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ABSTRACT 


A  new  approximation  technique  to  a  certain  class  of  nonlinear  filtering 
problems  Is  considered^  this  repoi^>  The  method  Is  based  on  an  approxima¬ 
tion  of  nonlinear,  partially  observable  systems  by  a  stochastic  control 
problem  with  fully  observable  state.  The  filter  development  proceeds  from  the 
assumption  that  the  unobservables  are  conditionally  Gaussian  with  respect  to 
the  observations  initially.  The  concepts  of  both  conditionally  Gaussian 
processes  and  an  optimal -control  approach  to  filtering  are  utilized  In  the 
filter  development.  A  two-step,  nonlinear,  recursive  estimation  procedure 
(TNF),  compatible  with  the  logical  structure  of  the  optimal  mean-square  esti¬ 
mator,  generates  a  finite-dimensional,  nonlinear  filter  with  Improved  charac¬ 
teristics  over  most  of  the  traditional  methods.  Moreover,  a  “close"  (In  the 
mean-square  sense)  approximation  for  the  original  system  will  be  generated  as 
well.  In  general  the  nonlinear  filtering  problem  does  not  have  a  finite¬ 
dimensional  recursive  synthesis.  Thus,  the  proposed  technique  may  expand  the 
range  of  practical  problems  that  can  be  handled  by  nonlinear  filtering..  A 
detailed  derivation  for  the  filter  with  global  property  Is  presented.  Exten¬ 
sion  of  the  results  to  large-scale  nonlinear  systems  Is  accomplished  ^Incor¬ 
porating  a  novel  decomposition  scheme  In  the  filter  design. 

Application  of  the  developed  filter  to  a  scaJjH*~iTonTTnear  system  which 
lacks  model  "smoothness"  Is  presented  in  [K2],  ^Application  of  the  derived 
multi -dimensional  filtering  algorithm  to  two  low-order,  nonlinear  tracking 
problems  according  to  a  global  criterion  and  a  local-time  criterion  respec¬ 
tively  are  presented.  Also,  a  comparison  with  traditional  methods,  such  as 
the  popular  Extended-jUt  ~<an  Filter  (EKF),  are  given  via  digital -computer 
simulation  to  demonstrate  the  effectiveness  of  the  obtained  results. 
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NOTATIONS 


The  following  notations  will  be  used  throughout  this  dissertation: 
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a  , 

F  , 
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p  . 

(a,  F,  P)  , 
tr  {.)  , 

E  (.)  , 


Euclidean  n  -  dimensional  space 
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Sub  o-algebra  c^F 
Probability  measure 
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II  II  , 
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Euclidean  norm 
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Refer  to  equation  or  set  of  equations  where  1 
denotes  the  chapter,  and  j  Is  the  serial 
(sequential)  nwbcr  of  that  equation 


First-order  derivative  with  respect  to  t  (time) 
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Second-order  derivative  (Jacobian  matrix) 


x  , 
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(m.s.e.)  , 
RHS  , 


Random  variable 

Stochastic  random  process 
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W1 ener-process  sequence 
Mean  square  error 
Root  mean  square 
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1.  INTRODUCTION 


The  behavior  of  natural  phenomena  in  general  does  not  follow 
strict  linear  deterministic  laws.  Modem  technology,  with  its  great 
refinement  of  instrumentation,  has  made  it  abundantly  evident  tha 
the  formulations  of  natural  laws  in  which  troublesome  nonlinear  ter 
are  suppressed  or  neglected  to  achieve  workable  linearizations,  oft< 
lead  to  faulty  results  or  to  sacrifices  of  precision,  which  are  . 
longer  tolerable.  Consequently,  the  simple  mathematical  determin¬ 
istic  model  must  be  modified.  This  can  be  achieved  by  using  a  sto¬ 
chastic  nonlinear  model  representation  where  the  dynamic  behavior  of 
a  physical  system  Is  formulated  In  terms  of  the  evolution  of  the 
state  (xt)  of  the  system,  under  the  Influence  of  random  disturbance 
Ut),  as  a  solution  of  a  set  of  stochastic  differential  equations: 

x  »  f(t,xt,  Ct),teT  ,  (1-1) 

where  f  (.)  Is  a  nonlinear,  real  n-vector  function; 

xt  -  a  stochastic  process  n-vector  which  usually  cannot  be 
measured  directly, 

t  -  the  time  variable,  tc[0,  T], 

Ct  -a  vector  stochastic  process  to  model  the  random 
disturbance. 

To  avoid  confusion,  the  following  notation  will  be  adapted  In 
the  sequel.  By  writing  xt,  one  Interprets  x  as  a  stochastic  process, 
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while  x(t)  will  mean  that  x  is  a  deterministic  function  of  time. 
Moreover,  (x{t)),  (xt)  will  mean  deterministic  and  stochastic 
sequences  respectively. 

An  Important  soecial  class  of  (1-1)  is  the  stochastic  differen¬ 
tial  equation  with  the  random  disturbance  modeled  as  an  additive 
white  Gaussian  noise  flt,  which  has  well-known  characteristics. 
Hence,  a  white-noise  model  Is  used  here  with 

f(xt,  t)  +  G(xt,  t)8t  .  (1-2) 

It  Is  Important  to  note  here  that  this  model  may  lead  to  mathematical 
difficulties  In  the  case  of  classical  Integrals.  However,  we  may 
replace  the  white-noise  model,  (1-2),  by  an  equivalent  "Wiener- 
process"  model  as  follows: 

dxt  -  f(*t,t)dt  +  G(xt,t)dw  ,  (1-3) 

which  Is  Interpreted  In  the  Ito  sense  [LI].  More  detailed  discussion 
of  the  above  equivalent  formulations  Is  given  in  Appendix  A. 

In  most  practical  problems,  the  states  of  the  system,  which 
model  Its  dynamical  behavior,  connot  be  observed  directly;  only  noisy 
versions  can  be  measured.  This  noise  Is  due  to  the  unavoidable  meas¬ 
urement  error,  and  In  most  cases  Is  modeled  for  convenience  either  as 
an  additive  Gaussian  process  or  as  an  additive  Wiener  process.  Con- 


3 


sequently,  a  filtering  algorithm  may  be  employed  to  extract  and  esti¬ 
mate  reliably  the  states  of  the  system  from  the  measured  values. 
Stated  another  way,  a  given  system  has  Its  dynamics  modeled  by  the 
following  version  of  (1-1): 

xt*  yt*  *t^  *  (1*4) 

where  f(.),  t,  xt,  are  as  defined  before.  Here  yt,  the  observable 
states,  (can  be  measured  directly)  evolve  according  to  the  following 
nonlinear  differential  equations: 

&  *  H(t,  xt,  yt,  vt)  .  '  (1-5) 

Again,  H(.)  Is  a  nonlinear,  real  n-vector  function,  and  vt  Is  a 
vector  stochastic  process  modeling  the  observation  noises,  which  may 
be  caused  mainly  by  measurement  noises  and  environmental  effects. 

Assume  that  (1-4),  (1-5)  can  be  solved  for  each  realization  of 
€,v.  Then,  realization  of  x,y  which  are  also  of  a  stochastic  nature, 
are  defined.  Furthermore,  to  completely  define  the  filtering  prob¬ 
lem,  a  performance  criterion  wst  be  stated,  which  defines  in  what 
sense  the  estimate  should  follow  the  state.  In  this  research,  the 
mean-square  criterion  Is  adapted  because  of  both  Its  mathematical 
convenience,  and  the  distance  measure  It  provides  the  estimate  to 
follow  "closely*  the  original  state.  Hence, 
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J(xt,  xt)  ■  E  /  (l!xt  -  xtir)dt 


(1-6) 


Here,  E(.),  Il.ll,*tare  the  expectation,  the  Euclidean  norm,  and 
the  “best"  estimate  of  xt  respectively.  This  particular  criterion 
was  chosen  to  ensure  a  “global"  filtering  criterion.  However,  other 


criteria  might  be  used  depending  on  the  application  of  Interest. 


The  filter  equation  can  be  modeled  as: 


xt»  K  (y(s);  sc[0,t])  ,  (1-7) 

where  xt  ,  the  estimate.  Is  defined  once  the  structure  of  the  K 
operator  Is  obtained.  Thus,  If  we  assuae  that  E(||xt  ll2)<»» 
te[0,T],  then  It  Is  well  known  [D2]  that  xt  ,  the  best  estimate.  Is 
generated  by  the  following  formula: 


xt  ■  E(xt/ys;  se[0,t])  ,  (1-8) 

where  E(/)  denotes  the  conditional  expectation.  Moreover,  with  addi¬ 
tional  assumptions  about  the  structure  of  (1-4),  and  (1-5),  a  recur¬ 
sive  version  of  (1-8),  (closed  form)  can  be  found  (l.e.  the  estimated 
value  of  at  time  (t+At)  can  be  generated  by  a  recursive  formula 
If  given  the  value  of  the  estimate  xt  at  time  t  and  the  observations 
(ys;  sc(t,  t+a))).  This  Is  Immediately  recognized  within  the  class 
of  linear  systems  with  linear  observations  and  additive  Gaussian 
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noises,  as  the  optimal  (In  m.s.e),  state  estimator  and  consists  of  a 
finite  dimension*!  linear  filter.  The  latter  is  due  to  the  Gaussian 
assumption  In  the  system  which  permits  the  conditional  probability 
density  function  to  be  completely  characterized  by  only  the  condl- 
tlonal  mean  and  the  conditional  covariance  [D2] .  However,  for  non¬ 
linear  systems  this  fortunate  situation  does  not  generally  exist. 
Obviously,  nonlinear  filtering  techniques  are  more  general  and 
greatly  expand  the  range  of  practical  problems  which  can  be 
handled.  But,  its  optimal  estimator  generally  consists  of  an 
Infinite-dimensional  system  of  moments,  or  equivalently  a  partial 
differential  equation  that  has  an  Infinite  number  of  dimensions. 
Consequently,  approximation  and  ad-hoc  techniques  usually  are 
employed  to  construct  practical  filters  for  nonlinear  systems.  The 
classical  methods  described  in  recent  literature  for  realizable 
nonlinear  filters  can  be  roughly  classified  Into  two  categories. 
They  are  either  probabilistic  or  statistical  approaches.  In  the 
statistical  approaches,  the  basic  Idea  Is  to  linearize  the  nonlinear 
equation;  then  the  Kalman-Bucy  method  Is  applied.  The  linearization, 
which  Is  a  first  order  power-series  expansion  of  the  nonlinear  terms. 
Is  generally  performed  either  with  respect  to  a  given  “reference 
trajectory"  or  with  respect  to  an  estimator  that  Is  obtained  by 
filtering,  such  as  the  case  In  the  Extended  Kalman  Filer  EKF.  But, 
these  approaches  have  some  drawbacks.  For  Instance,  In  the  first 
case  the  true  trajectory  should  be  close  to  the  assured  one  which  Is 


sometimes  hard  to  fulfill.  In  the  second  case,  since  the  filter 
parmeters  are  functions  of  the  state  estimate,  an  error  In  the 
estimate  Impacts  filter  gain  and  can  result  In  filter  bias. 
Inconsistency,  and  even  divergence.  In  the  probability  approach, 
which  was  started  by  Stratonavich  [SI]  and  developed  by  Kushner  [K4], 
Bucy  [Bl],  Jazwlnskl  [Jl],— to  mention  a  few,  the  general  procedures 
are  as  follows:  First,  the  equation  of  evolution  for  the  conditional 
probability  density  functions  are  determined.  Then,  equations  of  the 
conditional  moments  are  developed.  Finally,  various  heuristic 
assumptions  and  arbitrary  truncation  schemes  are  applied  to  the 
evolving  Infinite  dimensional  filter  equations  to  generate  finite 
versions. 

The  basic  common  assumption  that  Is  used  In  the  filtering 
algorithms  mentioned  above.  Is  the  requirement  of  the  “differentia¬ 
bility"  or  “smoothness”  of  the  nonlinear  terms.  This  results  In 
replacing  the  global  properties  of  the  filter  by  local  properties, 
and  "derivatives"  which  are  further  aggravated  by  noise.  This 
assumption  restricts  the  range  of  direct  application  (unless  they  are 
heurlstlcally  modified)  of  the  above  algorithms. 

It  Is  well  known  that  linear  filtering  Is  of  paramount  Impor¬ 
tance  to  sonar  (active  and  passive)  and  radar  applications. 
Estimation  In  sonar  Is  often  associated  with  the  localization  of  a 
target  which  has  already  been  detected.  Localization  Is  essentially 
a  parameter  estimation  problem  where  the  parameters  of  Interest 
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typically  are  target  range,  Doppler  (radial  velocity)  and  azimuth 
angle.  The  sonar  "measures"  one  or  more  of  these  quantities  as  a 
function  of  time  using  the  observed  sensor  data  to  obtain  a  history 
of  the  target  track  for  surveillance  or  fire  control.  In  general, 
the  measured  quantities  are  nonlinear  functions  of  the  localization 
parameters  so  that  nonlinear  estimation  techniques  mist  be  used  to 
establish  the  target  track.  Various  manual  -  deterministic  and 
automated  -  sequential  methods  such  as  the  EKF  are  currently  In 
use.  There  are  obvious  shortcomings  of  the  EKF  for  passive  tracking, 
such  as  the  "111  conditioning"  of  the  error  covariance  matrix  due  to 
false  observability  which  causes  the  filter  divergence.  Detailed 
studies  of  the  EKF  shortcomings  are  given  In  [A2],  and  [Cl]. 
Consequently,  the  development  of  a  "global"  or  "local  time"  nonlinear 
filter  that  does  not  suffer  from  the  above  shortcomings  would  be 
quite  attractive,  specifically  to  sonar  applications.  The  proposed 
nonlinear  filters  developed  herein  do  have  the  above  features. 
Hence,  applicability  to  sonar  applications  must  be  fully  explored. 

This  dissertation  Introduces  new  finite-dimensional  filtering 
structures  for  a  certain  class  of  nonlinear  systems  which  offers: 

1.  Better  filtering  accuracy  than  the  traditional  techniques. 

2.  Does  not  require  the  "smoothness”  assumption  of  the  existing 
techniques. 

A  global  filtering  criterion. 


3. 


It  Is  the  general  goal  of  this  work  to  devise  a  means  of  coping 
with  many  of  the  drawbacks  of  the  exlsltlng  techniques.  The  object 
of  this  thesis  Is  the  design  of  a  new  nonlinear  filtering  approx¬ 
imation.  The  filter  structure  Is  recursive,  easily  Implementable, 
efficient,  and  finite  dimensional.  The  concept  of  both  conditionally 
Gaussian  processes  and  an  optimal  control  approach  to  filtering  are 
used  In  the  filter  derivation.  The  method  of  solution  Is  based  on  a 
result  of  Llptser  and  Schlryayev  [L2]  which  was  rigorously  extended 
to  the  vector  case  by  Kolodzlej  [Kl],  This  approach  combines  the 
advantages  of  a  sound  theoretical  basis,  generation  of  an 
approximation  model  for  the  original  nonlinear  system,  and  generation 
of.  a  finite-dimensional  nonlinear  filter  which  has  certain  Improved 
characteristics  over  most  classical  methods,  such  as  the  popular  EKF. 

The  suggested  technique,  which  could  be  called  ”an  approximation 
In  the  parameter  space,"  consists  of  three  basic  concepts: 
approximate  fee<fcack  control  modeling,  conditionally  Gaussian 
filtering,  and  control  law  computation.  The  use  of  the 
conditionally -Gaussian  concept,  which  was  formally  Introduced  by 
tipster  and  Shlryayev  [12]  allows  a  closed  system  of  equations  for 
calculating  recursively  xt  ■  E{xt/yt)  and  rt-  Cov  (xt/yt)  which 

completely  characterize  the  conditional  Gausslan-dlstrlbutlon 
function  Plx^a/y^),  t*Q.  That  provides  a  certain  class  of 
stochastic  nonlinear  systems  with  the  same  tools  as  the  Gaussian 
assumption  provides  for  linear  systems. 


9 


The  main  advantage  of  this  approach  over  the  traditional 
techniques  are:  First,  the  basic  and  most  common  asswptlon  of  the 
■differentiability"  or  ■smoothness"  of  the  nonlinear  terms  Is  not 
required  here.  This  Is  Important  for  filter  stability  and 
susceptabillty  to  noise  aggravation;  second,  the  model  Itself  Is 
approximated  rather  than  approximating  the  evolving  filter  equations, 
as  Is  the  case  In  most  of  the  existing  filter  methods.  Consequently, 
one  has  more  flexibility  In  adjusting  the  approximating  parameters 
and  end  up  with  a  good  (In  m.s.e.  sense)  approximation  model,  as  well 
as  an  approximated  finite-dimensional  nonlinear  filter.  That  Is  not 
the  case  when  the  filter  equations  are  approximated  directly,  due  to 
the  strict  requirements  of  the  filter  theory.  Finally,  In  most 
linearization  techniques  the  available  observations  are  used  only 
through  the  Innovation  process  In  the  filtering  algorithm  so  valuable 
Information  may  be  wasted.  Herein,  more  complete  use  of  the 
available  observations  are  undertaken  through  the  concept  of 
conditioning  the  process  on  the  given  Information  as  well  as 
channeling  It  back  through  the  Innovations  process. 

The  dissertation  organization  Is  as  follows:  chapter  one  Is 
mainly  tutorial  In  the  sense  that  it  reviews  briefly  the  significant 
techniques  that  are  used  In  nonlinear  filtering  theory.  It  also 
Includes  the  definition  of  the  goals,  the  scope  of  this  research,  and 
an  outline  of  the  organization  of  the  dissertation.  In  chapter  two, 
a  formal  definition  of  the  nonlinear  filtering  problem  is  given,  and 
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the  structure  of  an  optimal  finite-dimensional  nonlinear  filtering 

approximation  Is  developed.  In  chapter  three,  a  major  feature,  a 

"  decomposition  scheme,11  Is  Incorporated  In  design  strategy  of 
filtering  a  certain  class  of  large-scale  nonlinear  systems.  Chapter 
four  presents  application  examples  to  Illustrate  the  proposed 
filtering  algorithm  via  digital  computer  simulations.  A  comparison 
with  traditional  techniques  such  as  the  EKF  and  the  Modified 
Truncation  Second-Order  Filter  MSOF  (when  It  Is  applicable)  are  also 
given.  This  comparison  Is  based  on  performance,  accuracy  and  cost  of 
computation  and  storage  requirements.  Finally,  Chapter  five  presents 
concluding  remarks,  and  comments  on  future  areas  of  research. 

Appendix  A  presents  a  brief  discussion  of  the  equivalence 

between  the  Wiener-process  formulation  and  the  white-noise 
formulation  In  modeling  stochastic  differential  equations.  Appendix 
B  Is  Intended  to  be  an  easy  reference  to  two  Lemmas  used  In  this 


text. 
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2.  AN  APPROXIMATE  FINITE-DIMENSIONAL 
NONLINEAR  FILTER 


In  this  chapter  the  nonlinear  filtering  problem  Is  defined,  and 
the  derivation  of  a  bilinear  feedback -control -model  approximation 
with  Its  correspondl ng  finite-dimensional  filter  Is  presented.  In 
general,  the  suggested  technique  uses  extensively  three  basic 
concepts,  bilinear  feedback  modeling,  control -law  computation,  and 
condltlonally-Gausslan  filtering.  Hence,  these  three  significant 
concepts  are  formally  Introduced.  The  new  filter.  In  general, 
consists  of  two  major  steps.  In  the  first  step,  the  given  system  Is 
approximated  by  a  bilinear,  feedback -control  model.  In  the  second 
step,  the  state  estimator  (m.s.e.)  Is  computed  using  the 
condltlonally-Gausslan-fllter  format.  Thereafter,  It  will  be 
referred  to  as  the  two-step  nonlinear  filter  (TNF). 

2-1  A  Nonlinear  Filtering  Problem 

Consider  as  given  some  conplete  probability  space  (a,  F,  P)  with 
a  nondecreasing,  family  of  sub-c-algebra  c^F,  te[0,T].  Let  (wt1), 

1  *  1,2, . n  be  mutually  Independent  Wiener  processes  comprising  a 

vector  of  dimension  n.  Also,  given  a  nonlinear  model  with  dynamics 
described  by  a  family  of  stochastic  differential  equations  (In 
the  Ito  sense)  of  the  form 

dzt  ■  f(t,z)  dt  +  o(t,z)dwt  . 


(2-1) 
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Here,  te  (0,T),  z  e  C  the  space  of  continuous  functions  <  Rn,  and 
f  :  [0,T]  x  Rn  Rn  nonlinear,  real  n-vector  functions, 
a  :  [0,T]  x  Rn  ♦  Rn  x  Rn  matrix. 

The  nonlinear  functions  Involved  In  equation  (2-1)  are  assured  to 
have  appropriate  properties  to  guarantee  the  existence,  uniqueness, 
and  continuity  of  sample  solution  with  probability  one  (LI).  These 
sample  solution  properties  are  crucial  for  modeling  a  physical 
practical  system  by  (2-1)  and  essential  for  simulating  the 
corresponding  model  by  digital  computer. 

Suppose  now  that  z  c  Rn  Is  written  In  terms  of  two  components 
**(x.y).  where  y  c  R*f  x  c  Rn"*.  Correspondingly,  f,  a  can  be 
written  as 

f  •  (F,H),  a  • 

where  F,H  are  (n-m),  m  dimensional  vectors  respectively.  G,R  are  of 
dimensions  (n-m)  x  n,  m  x  n  respectively.  The  vector  y  represents 
the  noisy  partial  observations  that  are  made  of  the  whole  process. 
Thus,  the  observation  e-fleld  is  the  sub  o-fleld  of  zt  defined  by  Yt 
■  e-algebra  (ys;  Ctfs<t).  Notice  that  If  m  »  n  the  system  In  (2-1)  Is 
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completely  observable.  Hence,  the  system  In  (2-1)  can  be  decomposed 
as  follows: 

dx^  «  F  (z,t)  dt  +  G  (z,t)  d*t  ,  (2-2-a) 

(2-2) 

dyt  *  H  (z,t)  dt  +  R  (z,t)  dwt.  (2-2-b) 

Here  again  (2-2-a),  (2-2-b)  are  Interpreted  In  the  Ito  sense  (LI). 

The  system  In  (2-2)  is  nonlinear,  then  the  optimal  estimate  for 
a  minimal -variance  criterion,  (l.e.  mean-square-error  sense).  Is 
known  to  be  the  conditional  expectation  of  the  state  of  the  system 
given  the  observation,  {y$;  0<s<t),  te[0,T]. 

*t  ■  E  (xt/y$;  sc  (0,t))  ,  (2-3) 

where  E(/)  denotes  the  conditional  expectation.  In  principle  a 
sequential  version  of  (2-3)  can  be  found,  (l.e.  the  estimate 
*t+at  at  ^  +  can  be  9enerat*d  recursively  If  given  the 

value  of  the  estimate  xt  at  time  t  and  the  observations  (y$  ; 
scCt,t+at]).  But,  In  general,  the  recursive  formulae  consist  of  an 
Infinite-dimensional  system  of  moment  equations  which  are  needed  to 
coapletely  characterize  the  conditional  probability  density  p(xt, 
t/ys  ;  sc[0,t]).  Thus,  an  approximation  must  be  made  for  practical 
Implementation.  A  new  approximation  method  will  be  developed  In  the 
following  sections.  * 
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2-2  A  Bilinear  Feedback  Approximation  Model 

A  bilinear  control  model  which  approximates  the  nonlinear  system 
In  (2-2)  Is  presented  here.  This  technique  is  called  "an 
approximation  In  the  model  parameter  space."  Its  parameters  are 
functions  of  the  feedback  control  law  ut  which  is  Itself  a  functional 
of  the  observation  processes  yt. 

In  general,  the  model  can  be  approximated  as  follows: 

dxt  “  dt  +  G  (xt»  “t*^  ^t* 

(2-4) 

dyt  ■  H  (xt,ut,t)dt  +  R  (xt,ut,t)dwt  , 

m  m  m 

where  (F(.),  G(.),  H(.),  R(.))  are  functionals  of  ut.  The  control  ut 
Is  measurable  with  respect  to  Yt«o-algebra  (ys;  0$s*t),  the 
stochastic  process  defined  on  te[0,T],  and  It  Is  chosen  to  minimize 
the  following  "mean-square  global"  criterion: 

Q(u)  -  E(/T||F  -  FJI2  dt)  ,  (2-5) 

o 

where  Ft  denotes  any  of  the  functions  F,  H,  R,  or  G  In  (2-2),  and 
Ft  Is  Its  corresponding  approximation  In  (2-4).  Here,  the  norm 
II. 1 1  Is  a  Euclidean  norm,  and  the  arguments  (xt,  yt,t),  (xt,ut,t) 
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are  omitted  for  brevity.  It  should  be  noted  that  the  choice  of  this 
performance  index  (2-5)  ensures  a  form  of  global  filtering  even 
though,  the  search  for  the  approximation  falls  into  a  class  of 
stochastic-control  problems  and  depends  strongly  on  the  type  of 
nonl Inearl  ties  in  the  system.  However,  In  [K3]  it  was  emphasized 
that  using  a  "local*  filtering  criterion  will  eliminate  the 
dependence  of  the  approximating  parameters  on  the  particular  form  of 
nonlinearities  in  the  system.  Thus,  It  Is  a  trade  off  between 
filtering  properties  and  the  conplexity  of  the  search  for  the 
approximation  parameters. 

An  important  special  class  of  (2-4)  Is  the  following  bilinear 

form: 


dxt  *  [A(ut,t)xt  +  B(ut,t)]dt  +  G(ut,t)dwt, 

(2-6) 

dyt  «  CC(ut,t)xt  +  0(ut,t)]dt  +  R  (ut,t)dwt, 

where  (A,B,C,0,G,R)  are  of  appropriate  dimensions,  and  are  linear 
functionals  of  ut.  The  term  bilinear  refers  to  the  fact  that  the 
system  Is  linear  In  the  control  and  state,  but  not  jointly  linear 
[M2],  how  assune  ut  which  minimizes  (2-5)  is  available;  then  (2-6) 
Is  a  "close"  approximation  model  to  the  original  system  in  (2-2),  and 
the  minimization  criterion  Is  a  measure  of  the  quality  of  the 
approximation. 
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Remarks 

(I)  Although  the  bilinear  structure  In  (2-6)  requires  the  Identifi¬ 
cation  of  more  parameters  than  the  ones  In  the  original  system 
In  (2-2),  It  Is  mathematically  more  convenient  for  the 
derivation  of  the  filter. 

(II)  A  unique,  explicit,  general  form  for  (A,  B,  C,  D,  G,  and  R  ) 
cannot  be  given,  because  they  depend  strongly  on  the  type  of 
nonl Inearl  ties  In  the  original  system. 

(III)  The  model  may  Include  a  broad  class  of  non-Gausslan  noise 
source,  since  non-Gausslan  or  even  nonstationary  processes  may 
be  modeled  by  Wiener  noise  passed  through  an  appropriate 
nonlinear  filter  of  this  class. 

(1v)  The  use  of  the  feedback  control  In  the  approximation  will 
couple  the  filter  equations  (l.e.  the  covariance  equations  will 
be  functions  of  the  estimates).  But  unlike  the  other 
linearization  techniques  this  may  enhance  the  stability  of  the 
filter  due  to  the  feedback  structure. 
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2-3  Conditionally  Gaussian  Processes 

An  Important  concept  which  Is  used  extensively  In  this  research 
Is  the  conditional  Gaussian  concept.  Llpster  and  Shiryayev  [L2] 
formally  define  It  as  follows: 

Theorem  2-1: 

Let  (with  probability  one  p.a.s)  the  conditional  distribution 
P(xg*a0/y0),  aQeR  be  Gaussian,  Njmg.rg)  with  (Xro^*  Then  the 
random  process  (xt,yt),  (XtiT,  satisfying  a  diffusion  type  of 
equations  as  In  (2-6)  where  the  parameters  satisfying  conditions  (11- 
4)-(l  1-11)  of  [L2],  Is  conditionally  Gausslon  such  as:  for  any  t>0, 
0<t0<t1....tn<t,  the  conditional  distributions 

Fy^(x0,...,xn)  *  P(xt^a0, ...,xt  -$an/ct) 
are  (p.a.s)  Gaussian. 

The  proof  of  this  theorem  is  very  long  and  is  given  In  (L2). 
This  result  Is  very  Important  since  it  allows  a  closed  system  of 
equations  for  generating  recursively  xt*  E(xt/yt),  and 
rt»  Cov  (x^/y^)  .  (This  is  obtained  by  replacing  the  complex 
confutation  of  the  conditional  expectation  In  (2-3)  by  a  simple 
Integration.)  And  these  two  parameters  completely  characterize  the 
conditional  distribution  P(xt<a/yt),  t*0.  So,  It  provides  a  class  of 
stochastic  nonlinear  systems  with  the  same  tools  that  the  Gaussian 
assunptlon  provides  for  linear  systems. 
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Subsequently,  for  this  class  of  nonlinear  systems,  the  concept  led  to 
the  development  of  a  finite  conditionally -Gaussian  filter  by  Lipster 
and  Shiryayev  [12] .  It  also,  offers  more  flexibility  in  control 
applications  than  do  linearization  techniques.  This  advantage  has 
been  demonstrated  by  Kolodzlej  [Kl]  and  Mohler  and  Kolodzlej  [Ml]. 

It  should  be  emphasized  here  that  in  application  the  necessary 
assumption  of  Xq  given  to  be  conditionally  Gaussian  can  be 
satisfied  under  realistic  operating  conditions.  This  may  result  from 
a  physical  consideration  or  from  a  direct  approximation  of  the 
distribution  of  xq  given  yQ  by  a  Gaussian  process.  In  the  first 
case,  for  example,  the  error  of  the  estimate  of  xQ  given  yQ  might  be 
caused  by  many  randan  phenomena  which  In  turn  might  be  approximated 
by  a  Gaussian  process.  Nevertheless,  this  does  not  necessarily  mean 
that  either  xQ  or  yQ  has  to  be  Gaussian. 

2-4  A  Conditionally  Gaussian  Filter 

This  finite-dimensional  filter  is  derived  by  Lipster  and 
Shiryayev  [L2]  and  is  rigorously  extended  to  the  nultl-dimenslonal 
case  by  Kolodzlej  [Kl]  who  also,  relaxed  some  of  the  required 
conditions  suggested  earlier  In  [L2] . 

To  suranarlze  their  results,  consider  the  system  in  (2-6)  which 
Is  partially  observable.  At  any  time  t  It  is  desired  to  estimate  the 
unobservable  state  xt  using  realizations  of  the  observation  state 
yt.  Let  (x0,yQ)  be  the  initial  states  for  (2-6)  which  are  assuned  to 
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be  Independent  of  the  Wiener  processes  wj,1»l,2.  The  parameters 
(A( . ) ,  8(.),  C(.),  0(.),  G(.),  R(.)  )  are  of  the  appropriate 

dimensions  and  their  elements  are  assumed  to  be  measurable  non- 
antlclpatlve  functionals  on  [0,T]xCTn.  Also,  assuae  that  ut  Is  a 
measurable  functional  of  Yt  where  Yt  is  the  o-algebra  generated  by 
[ys;  Oss^t].  Then,  sufficient  conditions  for  derivation  of  a 
recursive  optimal,  mean-square  estimate  of  xt  given  (ys;  0*s<t)  are 
given  below.  For  all  u  e  Cjn, 


T  , 

/  1 1  A(t,u) 1 1 1  dt  $  •,  and 

0 


(2-7) 


/T[| lB(t)| |4+| |C(t,u)| |4  +  1 |D(t,u)| I2  +  G(t,u>l l4]dt<-  .  (2-8) 

0 

* 

For  u,neCtn,  te[0,T]  define 


R2(t,u)  »  R(t,u)  R*(t,u)  , 


then  I iR"2(t,u) 1 1  $  c  <  •  . 


(2-9) 


Also,  assume  that 

I  lR(t,u)  -  R(t,n)||2s<  c(/T  I  ( u(  s )  -  n(s)ll2dk(s) 

0 

+  ||u(t)  -  n(t)l I2)  , 
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and  (2-10) 

1 1  R(t,u)  1 1 2  tf  c  (q/T(1  +  Ilu(s)||2  dK(s)  ♦  1  +  |  lu(t)|  |2), 

where  K(s)  is  a  nondecreasing  right-continuous  function  0$  K(s)  ^  1; 
c  Is  a  positive  constant.  The  following  coaments  are  In  order: 

(I)  conditions  (2-7)  -  (2-10)  ensure  the  existence  and  uniqueness 
of  a  uniform  parabolic  solution  to  the  system  In  (2-6)  which  is 
Important  for  real  system  modeling  and  simulation  by  digital 
computer; 

(II)  conditions  (2-7),  (2-8)  are  assuaed  to  assert  that  (xt,yt)  are 
square  Integrate.  For  example,  A(t,ut)  *  x2  will  violate 
these  conditions,  however  A(t,ut)  ■  /T  will  satisfy  these 
conditions.  These  conditions  also.  Imply  that 

JT| |B(t,u)| |4dt  <  •  . 

0 

This  Is  Important  since  It  will  restrict  the  additive 

stochastic  control  to  the  square  Integrable  class  that 

satisfies  E(  JT  llujl4  dt)<  -  ; 

0  1 

(III)  condition  (2-9)  Is  made  so  that  no  degenerate  stochastic 
measure  will  be  associated  with  Y.  Otherwise,  no  uniform 
parabolic  solution  exists  for  the  system  In  (2-6)  If  a  noise 
tern  Is  missing  In  the  equations  of  system  (2-6); 


(iv)  The  conditions  In  (2-10)  restrict  the  noise  coefficients  R  in 
(2-6)  to  a  class  of  smooth  functions  of  ut,  (due  to  the  first 
equation  of  (2-10)  which  Is  a  generalized  Lipschltz 
condition).  And  the  second  equation  of  (2-10)  ensures  that  its 
rate  of  growth  Is  limited  to  at  most  linear  growth  of  ut.  This 
Is  important  so  that  the  solutions  (xt,yt)  do  not  "explode"  in 
finite  time. 

From  Theorem  (2-1)  of  [Kl]  the  following  results  are  quoted: 
Theorem  2-2 

Let  equations  (2-6)  have  a  weak  solution,  (see  Appendix  A  for 
definition  of  strong  and  weak  solutions),  (xt,yt),  te  [0,T]  with  the 
Initial  states  (x0,y0)  satisfying 

E  Cllx0ll4]  <-  . 

(2-11) 

P(  1 1  y0l I  <  -)  -  i  . 

Let  the  conditional  distribution  P(xQ  4  ag/y0)  be  (P.a.s)  Gaussian 
with  parameters  *g  ■  E(Xg/y0).  Tg*  EC(Xg-Xg)(x0-x0)*/yg]  and 
tr(fg)  <  •  p.a.s.  If  conditions  [(2-7)  -  (2-10)]  are  satisfied,  then 
the  processes  (xt,yt),  satisfying  (2-6),  t*[0,T]  are  conditionally 
Gaussian,  l.e.  for  aqy  tc[0,T]  and  any  finite  partition  tj, 
>0,1,... k  of  [0,T]  such  that  0  <  tg  ^....t*  N<  t,  the  conditional 


distribution 
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P(xtO*  °0*  xtl*  al****xtk  *  at/ytJ  *  a  e  R" 

is  p.a.s.  Gaussian,  Yt  is  0-algebra  generated  by  {y$;  0  4  s  ^  t}. 
Further,  xt,  rt,  i.e.  xt  ■  (xt/yt)  and  rt  *  E[(xt-  xt)(xt-xt)*/yt3, 
are  unique  continuous  solutions  to 

dxt»  (Axt  +  B)dt  +  Kdv, 

K  «  (rtC*)  (RR*  )‘°‘5, 

dv  -  (RR*r°*5(dyt  -  (Cxt  +  0)dt),  (2-12) 

drt  »  (ATt  +  rtA*+  GG*-  KK*)dt, 

with  Xg  *  x(Q),  rQ»r(0)  as  initial  conditions. 

The  proof  of  the  theorem  is  parallel  to  the  proof  given  in  [Kl] 
and  [LI].  The  interested  reader  is  referred  to  the  given  references 
for  details.  The  equations  in  (2-12)  are  recursive  formulae  (filter) 
for  an  optimal  mean-square  estimate  of  xt.  The  arguments  (t,u)  are 
omitted  again  for  brevity.  The  finite  dimensionality  of  this  filter 
Is  due  to  the  conditional -Gaussian  assumption  of  the  processes 
(xt,yt).  A  schematic  representation  of  the  filter  is  given  In  Figure 
2-1  to  realize  a  functional  understanding  of  its  design. 
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2-5  A  Stochastic  Nonlinear  Control  Problem 

Following  Llpster  and  Shlryayev  [L2]  the  partially  observable 
system  (2-6)  Is  transformed  Into  a  completely  observable  system  (2- 
12).  Consequently  the  minimization  criterion  (2-5)  also  must  be 
transformed  In  order  to  solve  for  the  control  law  analytically.  Let 
the  ut  *  1>(t,y)  where  ^tCO.TlxC+R*1  such  that  ♦  Is  nonantlclpatlve, 
(l.e.  T(t,y)  *  T(t,y*)  If  y(s)  »  y*(s)  for  s  <4  t).  Now,  If  <|>  Is 
nonlinear,  (xt,yt)  which  satisfy  (2-6)  will  no  longer  be  a  normal 
process,  but  the  conditional  distribution  of  xt  given  {ys;  s  4  t}  Is 
still  normal  with  mean  and  covariance  given  by  (2-12). 

Let  *  (5,t,x,rt)  denote  a  differentiable  Gaussian  measure  with 
mean  x  ,  and  variance  rt. 

Oef 1 ne 


L  (t,  xt,  rt,  ut)  3  /  L  (t,  C,  u)  d  ♦  (5,  t,  x,  rt)  ,  (2-13) 

where  L  denotes  l!F  -  F  II*,  and  F,  P  are  as  defined  before. 

If  rt  Is  nonsingular,  then 


d  «  (e,  t,  x,  rj 


t'  "  7TTW77 


(2v),,/t|rtl 


Exp  -  0.5  C(C-x)*r"l(?-x)]  d?  , 

(2-14) 


where  lrtl  denotes  the  determinant  of  rt,  and  r*1  Is  the  inverse 


matrix  of  the  covariance 
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Now,  using  the  properties  of  expectation,  the  Bayes  formula,  and 
the  definitions  in  (2-13)  and  (2-14),  the  cost  function  In  (2-5)  can 
be  written  as 


T  -  , 

Q(u)  -  E(J  E(| IF  -  F| | z/y  ;  sn)dt), 
0  5 

T  - 

*  E  /  L(t,  xt,  rt,  ut)  dt. 


(2-15) 


The  Interchange  of  Integration  and  conditional  expectation  in  (2-15) 
Is  permitted  by  a  version  of  Fublnl's  theorem  [Rl],  and  It  Is 
justified  if  we  are  dealing  with  Integrals  of  Gaussian  random 
processes. 

Thus,  an  equivalent  completely  observable  system  (2-12),  (2-15) 
emerges  where  the  new  states  of  the  system  (X,  rt)  are  generated  by 
(2-12).  But  the  parameters  of  this  equivalent  system  are  functionals 
of  the  control  law.  Consequently,  we  have  to  solve  a  control-law 
problem  first.  Thus,  the  filtering  problem  Is  actually  replaced  by  a 
stochastic-control  problem,  which  results  In  a  stochastic  Bellman 
equation.  Apparently,  this  could  lead  to  a  more  difficult  problem  to 
solve  than  the  original  filter  problem  for  (2-2).  But  as  has  been 
shown  In  [K2]  an  approximation  to  the  control -law  can  be  found 
whlthout  solving  the  Bellman  equation.  In  the  mean  time,  the 
control-law  problem  Is  formulated  as  follows.  Let  z  »  ,  a 
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vector  of  appropriate  dimensions.  Then  dz 
12)  can  be  written  as 


\  / 


and  hence  (2- 


dz  «  f(z,u,t)dt  +  o(t,u)dv  , 


where 


f(z,  u,  t) 


A(t,u)x  +  B(t,u)  > 

Aft  +  +  GG*-  KK 

"  / 


(2-16) 


o(t,u) 


^k(t.u)  0 

0  0  * 

V  J 


Here  the  matrices  are  of  appropriate  dimensions. .  But  equation 
(2-16)  Is  a  "degenerate"  equation  because  the  noise  term  Is  missing 
In  the  covariance  equation.  Thus,  as  pointed  out  In  the  remark  about 
condition  (2-9),  no  uniform  parabolic  solution  exist  for  the  system 
because  o(t,u)  <x*(t,u)  Is  singular  which  Implies  the  existence  of 
singular  probability  measure  for  X,  r^.  Hence,  the  approximation 
of  a  degenerate  system  of  stochastic  equation  proposed  by  Fleming 
[FI]  which  satisfies  condition  (2-9)  Is  adapted  here,  where  small 
white-noise  terms  may  be  added  to  the  covariance  equation.  Thus  (2- 
16)  becomes 


dz#-  f(z,  u,  t)  dt  +  odw. 


(2-17) 


r  \ 

(  \ 

K(t,u)  0 

dv 

where 

7  « 

0  (2e)0,5 

,  dw  * 

dw 

16). 

k  / 

>  f( •)  Is  as  defined  in  (2- 


For  the  stochastic  control  problem  (2-15),  (2-17),  it  has  been 
shown  by  Fleming  [FI],  Davis  [Dl],  and  Ahmed  and  Tio  [Al]  that  there 
exists  an  optimal  control  for  the  system  described  by  (2-15),  (2-17) 
if  the  following  conditions  are  satisfied:  Let  Q  a  (Tq  x  T)xRn  where 
To  *  T; 

(I)  7,  f  are  Llpschltz  continuous  In  z,  (z  »  (x  ,  rt)  ; 

(II)  <F  Is  nonantlclpatlve  with  respect  to  w^,  i  »  1,2; 

(III)  f,  L  are  bounded  measurable  on  Q  (the  closure  of  Q)  for  each 
ueu,  where  u  Is  the  set  of  admlssable  controls  which  Is 

m 

continuous  for  every  (t,z)  e  Q  ; 

( 1 v)  f,  L  are  convex  for  each  (t,z)  eQ; 

(v)  a(t,z)  *  tr(aa*)  Is  nonsingular. 

Using  dynamic  programming  it  is  seen  that  there  exists  a  value 
function  V(t,  x,  r)  which  Is  differentiable  at  least  once  In  t  and 
twice  In  z,  and  which  satisfies: 

T 

V(t,z)  »  Inf  (Et  /  L(t,  z,  u)dt)  .  (2-18) 

u(0,T)  t 


The  corresponding  stochastic  Bellman  equation  Is 


28 

Vt  +  mjn  [0.5  tr(a*VZ2a)  +  V*f(z,  u,  t)  +  L  (z,  u,  t)]  ■  0, 
and  Y(z,  u,  T)  -  0  ,  (2-19) 

where  vt  *  It  »  vz  *  TZ  »  Vzz  *  T2~77~J  • 

Consequently,  a  function  V(.)  can  be  found  satisfying  equation  (2- 
19),  l.e.,  with  corresponding  control  function  u°(t,z),  such  that 

l%£lf(t,  z,  u°)  ♦  L  (t,  z,  u°)  «  {2‘20) 

C|r(t,z)  f  (t,  z,  u)  +  L  (t,  z,  u)], 

then  u°  Is  optimal.  Futhermore,  If  conditions  (1)-(v)  are  satisfied, 
then  (2-17)  has  a  smooth  solution. 

Intuitively,  the  optimal  control  Is  a  function  of  the  value 
function  (the  solution  of  the  stochastic  Bellman  equation)  but  the 
computation  (requires  solving  a  nonlinear  Cauchy-type  problem)  of  the 
exact  value  of  the  optimal  control  Is  In  general  tremendously 
complex.  However,  suggested  abstract  methods  to  solve  a  Cauchy-type 
problem  as  In  (2-17)  will  be  discussed  briefly. 

The  nonlinear  Bellman  partial  differential  equation  (2-19)  can 
be  stated  In  general  as  a  Cauchy  problem  of  the  following  form: 


I 
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Ct  +  H(t,  x,  Ct,  Cxx)  *  0,  (2-21) 

C(0.x)  -  C0, 

where  Ct  denotes  the  partial  derivative  with  respect  to  time,  and  Cx, 

Cxx  the  gradient  and  the  Jacobian  respectively. 

Roughly  speaking,  there  are  three  principle  approaches  for 

solving  the  problem  in  (2-21): 

(1)  Separation  of  variables  (also  called  the  Fourier  method,  or 

solution  by  eigen-function  expansion). 

(2)  Green  function  (also  called  fundamental  singularities,  or 

solution  by  Integral  equation). 

(3)  Variational  formulation  (also  called  the  calculus  of 

variations). 

According!  ,  some  of  the  Important  methods  that  have  been 

discussed  In  the  recent  literature  are: 

(1)  The  parametric  method,  which  Is  developed  by  Friedman  [F2], 

[F3].  Here  a  fundamental  solution  is  first  constructed,  then 
used  to  solve  the  Cauchy  prcnlem. 

(2)  Hllbert-space  method  [Gl].  The  Idea  of  separation  of  variables 
In  the  context  of  Hilbert  space,  and  nunencal  methods  are 
used.  This  Is  a  very  elegant  method  but  unfortunately  It  Is 
extremely  limited  In  Its  scope  of  application. 


i 
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(3)  Function  theoretical  method  [G2],  where  the  theory  of  integral 
operators  is  used.  But  there  is  no  unique  method  to  determine 
the  Integral  operator,  and  hence  a  general  solution. 

(4)  Characteristic  method,  [823 ,  where  the  idea  of  reducing  the 

P.O.E.  problem  to  an  0.0. E.  problem,  through  the  use  of  the 

corresponding  characteristic  function.  Then  a  set  of 
Hamilton's  equations  is  solved  by  simple  integration.  This 
method  guarantees  a  solution  only  where  (t;x)+X  is  invertible, 
that  is,  as  long  as  the  characteristics  do  not  intersect.  But 
any  nonlinearities  In  the  Hamiltonian  lead  to  crossing  of 
characteristics.  Hence,  the  application  to  nonlinear  systems 
is  limited. 

(5)  Transformation  methods  [A3],  where  the  transformation  may  be 

applied  to  either  the  dependent  variable,  the  Independent 
variables,  the  equation  itself,  or  any  combination  of  these. 
However,  the  initial -condition  transformation  may  be  a  real 
problem  In  this  technique.  For  example,  if  the  equation  In  two 
independent  random  variables  tct  +  xcx  *  2txc,  c(0,x)  =  a.  Is 
transformed  by  v  *  tx  Into  the  ODE,  the  single  variable  v  can 
take  on  only  a  single  value  on  any  manifold  tx  *  constant. 

The  choice  of  method  of  solution  to  (2-21)  from  the  above 

suggested  techniques,  depends  largely  on  the  following  factors: 

(1)  degree  of  nonlinearities  in  the  system,  (l.e.  some  of  the 
methods  may  not  be  applicable); 
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(11)  degree  of  difficulties  and  complexity  of  the  method. 

However,  the  transformation  method  seems  to  be  the  most 
promising  one  due  to  Its  simplicity  and  the  existence  of  various 
transformation  techniques  (e.g.,  the  cononlcal  transformation  and  the 
similarity  methods).  For  example.  If  the  system  in  (2-19)  Is  first 
order,  then  the  following  method  transforms  the  system  to  a 
quasi linear  system. 

Ct  *  -H  (t,  x,  Cx),  (2-22) 

C(0,x)  *  f ( x)  . 

Let  P  denote  Cx  and  q  *  P0  *  Ct.  Then  by  differentiation  with 
respect  to  t  and  equating  mixed  partial  derivatives,  the  following 
quasi linear  system  Is  obtained: 

q. 

Pt  *  qx,  (2-23) 

qt  *  -H  (t,  x,  P)  -  Hp  (t,  x,  P)  qx, 

C(0,x)  -  f ( x ) , 

P(0,x)  »  fx(x), 
q  (p,x)  ■  -H  (0,x,fx(x)) 

That  Is  If  C  Is  a  twice  continuously  differentiable  solution  of 
the  Bellman  equation,  then  (C,  P,  q)  Is  a  solution  of  (2-22),  and  the 
converse  can  be  proven  to  be  true  also. 
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2-6  An  Optlaal  Finite-Olnenslonal  Filtering  Algorithm 

The  general  filtering  algorithm  consists  of  the  following  steps: 

Step  1  -  Approximate  the  given  nonlinear  system,  (l.e.  the 
system  In  (2-2),  by  a  bilinear  feedback  model  that  has  the  form  In 
(2-6),  where  the  parameters  are  linear  nonantlcipatlve  functions/ 
functionals  of  the  control  ut,  (which  Is  measurable  with  respect  to 
the  cr-algebra  Yt  *  (ys;  s  ^  t}. 

Step  2  -  Choose  the  performance  index  for  the  global  filtering 
criterion  as  in  (2-5),  which  Is  also  a  measure  of  the  quality  of  the 
model  approximation  in  (2-6).  If  ut  which  minimizes  (2-5)  Is 
obtained,  then  (2-6)  Is  a  "close"  model  approximation  to  (2-2). 

Step  3  -  If  the  parameters  (A,  B,  C,  D,  G,  R)  satisfy  the 
conditions  In  (2-7)-(2-10),  then  the  correspondl ng  finite-dimensional 
filter  has  the  form  as  In  (2-12),  with  Its  parameters  as  functions  or 
functionals  of  u^  . 

Step  4  -  Transform  the  performance  index  defined  In  Step  2  to  an 
equivalent  criterion  as  In  (2-15)  using  (2-13),  and  (2-14). 

Step  5  -  The  system  In  (2-15),  and  (2-17)  forms  a  completely 
observable  stochastic  control  problem,  which  should  be  solved  by 
classical  dynamic  programming  to  find  the  optimal  control  ut  . 

Step  6  -  Once  the  optimal  control  ut  Is  obtained  from  Step  5 
and  then  substituted  back  Into  both  (2-6)  and  (2-12)  a  model 
approximation  as  well  as  a  finite-dimensional  nonlinear-filter 
approximation  will  be  generated. 
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Remarks: 

(I)  The  assunptlon  that  the  solution  of  the  control  problem  In  Step 
5  Is  close  to  u  ,  which  minimizes  the  performance  criterion 
defined  In  Step  2,  resembles  the  assunptlon  In  the  EKF 
approach,  and  closes  the  approximation  procedures. 

(II)  In  general,  the  stochastic-control  problem  In  Step  6  results  In 
the  nonlinear  Bellman  partial -differential  equation  as  In  (2- 
19).  Hence,  It  may  be  no  simplification  as  compared  to  the 
original  problem  of  finding  a  filter  for  (2-2).  This  Is  an 
Indication  of  a  possible  problem  one  will  encounter  In  using 
this  approach  If  the  exact  value  of  the  controls  are  required. 

However,  as  was  emphasized  In  [K2],  the  structure  of  the 
optimal  control  suggested  by  an  approximate  solution  to  the 
Bellman  equation,  which  results  In  an  approximate  control  that 
yields  a  better  performance  filter  than  the  EKF. 

The  following  example  [K2]  Illustrates  the  previous 
filtering  algorithm  steps: 

Let  dx^  *  a  lx  I  dt  +  dw^, 

(2-24) 

dy^  »  xtdt  +  dwg. 
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The  bilinear  feedback  approximation  model  Is 

dxt  «  ut  x^dt  +  dw^,  (2-25) 

dyt»  x^dt  +  d)«2 ^  ts[0,T], 
where  ut  Is  chosen  to  minimize 

t  , 

Q(u)  ■  mjn  E(^  (a  |x|  -  u  xtrdt  .  (2-26) 

From  (2-12),  the  optimal  (m.s.e)-fllter  equation  has  the  following 
form: 

dxt  *  utxtdt  +  (2-27) 

drt  ■  (20  rt  +  l  -  rt2)  dt, 

where  dvt  *  -  xtdt  ,  Is  the  Innovation  process,  having  relevant 

properties  of  the  Wiener  process.  Now  from  (2-13),  (2-14)  the 
equivalent  minimization  criterion  to  (2-26)  Is 

T  - 

Q  (t,  x,  t)  «  min  E  /  (a  Id  -  uc)z  d*(e),  (2-28) 

0 

where  d*(c)  Is  as  defined  In  (2-14). 
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Using  the  dynamic-programming  approach  for  the  stochastic 
control  system  defined  by  (2-27),  (2-28)  one  can  show  [02]  that  the 
optimal  u  Is  given  by 


2Xr 


,=2 


u  -  a  [  exp  (-C-.)  -  2(x  +  r)  erf  (- - ) 

rsr  ^  /w 


(2-29) 


-  (r|f  +  o.s  x  il)]  (x2+  r)'1, 

3x 


where  erf(o)  *  /  -~e'0,5^  . 

0  SIS 


Here,  V  Is  the  solution  to  the  Bellman  equation 


.2  u  2 


|f  +  0.5  r2  +  |f  (l  -  r2)  +  (x2+  r)  (a‘  -  ^ )  -  o, 

3  X  9 


v  (T,  X,  r)  -  o. 


(2-30) 


The  above  partial  differential  equation  Is  very  difficult  to 
solve  analytically,  and  It  Is  a  good  Indication  of  the  possible 
problem  one  may  face  while  attempting  to  solve  the  stochastic  problem 
defined  In  Step  6  of  the  algorithm.  However,  an  approximate  solution 
to  (2-30)  of  the  form 


V  (x,  r,  t)  •  p(t)  exp  (-  yi-)  ,  p(T)  ■  0, 


(2-31) 
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when  substituted  Into  (2-29)  yields  the  approximate  control 


2x/f  exp(-  Ip)  -  2  erf  (  -  -£-) 

u  -  a( - - - 2.  /2F  , 

rsr  (x*  +  r) 


which  as  shown  In  [K2]  gives  better  performance  than  the  EKF. 


(2-32) 


Figure  2-1 

Schematic  Diagram  for  the  Proposed  Conditional ly-Gaussian  Filter 
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3.  APPROXIMATE  -  FILTER  STRUCTURE 

In  this  chapter  a  new  filtering  structure  for  approximating  a 
certain  class  of  large-scale  nonlinear  filtering  problems  Is 

presented.  A  major  feature,  l.e.,  a  decomposition  scheme  Is 
Incorporated  Into  the  filter  design.  The  motives  to  develope  this 
scheme  are:  one,  to  resolve  the  "curse1*  of  dimensionality 

encountered  If  an  optimal  approximations  Is  sought,  due  to  the 

requirement  of  solving  a  control  problem  which  Is  a  function  of 

)  variables.  In  general,  this  is  a  very  difficult 

problem  to  solve.  Two,  to  alleviate  the  difficulties  with  respect  to 
control  policy  definition  and  calculation,  which  arise  If  the  (THF) 
Is  applied  directly  and  the  global  properties  are  assumed,  but  an 
approximate  control  Is  sought.  Finally,  using  this  scheme  will 

reduce  the  complexity  of  the  algorithm,  and  results  In  significant 
computer  saving  In  the  digital  simulation. 

The  strategy  adapted  In  this  decomposition  scheme  Is  based  on 
the  decomposition  of  the  system  into  two  inter-connected 
subsystems.  The  definition  of  the  subsystem  can  be  Imposed  using 
purely  physical  reasoning.  The  first  subsystem  is  only  linear  and  It 

will  act  as  a  first  stage  of  the  filtering  process.  The  second 

subsystem  Includes  all  the  nonl Inearl ties  In  the  system,  which  are 
then  approximated  by  the  proposed  two-step  nonlinear  filter  (TNF). 
This  scheme  has  the  following  advantages: 
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(I)  only  the  linear  filtering  problem  Is  solved  In  the  first  level; 

(II)  the  control  parameter  which  Is  needed  In  the  TNF  algorithm, 
will  be  easy  to  obtain  as  a  function  of  the  parameters  of  the 
first  stage  linear  filter.  So,  no  difficulties  with  respect  to 
measure  theory  or  control  policy  will  be  encountered  If  the 
global  filtering  criterion  Is  retained; 

(III)  In  many  cases,  there  Is  a  substantial  computational  saving  as 
compared  to  the  global  single  system  solution. 

An  Interesting  class  of  nonlinear  system  has  the  following  form 


I 


dxt  ■  F(x,y,t)dt  +  G(x,y,t)dw, 

(3-1) 

dyt  *  H(x,y,t)dt  +  o(t)dv  , 

where  w,v  are  nutually -Independent  vector  Wiener  processes  of 
appropriate  dimensions;  o(t)  Is  a  matrix  of  compatible  order.  The 
functions  F(.),  H( . ) ,  and  G(.)  can  be  partitioned  as  follows: 


F(x,y,t)  ■  fj(t)  x  +  f2(x,y,t)  , 


H(x,y,t) 


^hx(t)x  ' 
^(x.y.y^ 


(3-2) 


i 


s 
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G(x,y,t)  -  gx( t)  +  g2(x,y,t)  . 

Here,  fj,  hjt  and  gj  are  linear  matrices  of  appropriate  dimensions 
and  f2,  h2,  and  g2  are  nonlinear  functions  of  their  arguments,  and  of 
compatible  orders. 

The  general  outline  of  the  decomposition  scheme  and  filtering 
algorithm  are:  (1)  the  given  n-dlraenslonal  system  as  In  (3-1)  Is 
decomposed  Into  two  subsystems.  Subsystem  I  consists  of  a  linear 
system,  (l.e.  linear  dynamic  and  observation  equations).  Subsystem 
II  Is  a  nonlinear  system,  which  contains  all  the  nonl 1 neari tl es  of 
the  original  system  and  Is  approximated  by  the  proposed  model  space 
approximation. 

(2)  Apply  a  classical  filtering  technique,  l.e.  the  Kalman-Bucy 
algorithm  [02]  to  the  linear  system  In  Subsystem  I.  This  will  be 
considered  as  the  first  stage  of  the  filtering  algorithm. 

(3)  Find  an  appropriate  bilinear  approximation  model  to  the 
nonlinear  system  In  Subsystem  II. 

(4)  Finally,  from  Subsystem  I  and  the  bilinear  approximation  of 
Subsystem  II,  form  a  new  system.  This  new  system  will  be  of  the  form 
that  has  a  finite-dimensional,  conditionally -Gaussian  filter  as  In 
(2-12)  If  certain  assumptions  (2-7)-(2-l0)  (See  Chapter  2  Section  (2- 
3).)  about  the  system  parameters  are  satisfied. 
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The  block  diagram  representation  of  the  decomposition  scheme  and 
the  filtering  algorithm  Is  given  In  Figure  (3-1),  while  a  detail 
schematic  representation  Is  given  In  Figure  (3-2). 

To  summarize,  the  various  algorithmic  steps  are: 

Step  1.  The  given  nonlinear  system  as  in  (3-1)  can  be 
decomposed  into  two  subsystems. 

Subsystem  I 


dXft*  fl^^  x  dt  +  g]_(t)dw  , 

(3-3) 

dyft»  hj(t)  x  dt  +  c^t)  dVj  , 

where  flf  hj,  and  g^  are  as  defined  before,  and  w,  v^  are  Independent 
Wiener  processes  of  appropriate  dimensions.  The  subscript  f  denotes 
the  first  subsystem. 


Subsystem  II 


dx$t-  f2(x,ytt)dt  +  92<x»y.t)clw  . 

dyst“  h2(x»y,t)dt  +  a2dv2  * 


(3-4) 


where  the  subscript  s  denotes  the  second  subsystem,  f2(.),  h2(.),  and 
g2(.)  are  generally  nonlinear  functions  of  their  arguments,  and  w,  v2 
are  Independent  Wiener  processes  of  compatible  orders. 
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Step  2  The  Kalman-8ucy  filter  equations  for  the  system  in  (3-3) 

are: 

dXft  *  fj*fdt  +  phj*(ffjOj)"*dnt  ,  (3-5) 

dpt  *  ^ip  +  pfi*  +  9l9i*  “  dt  » 

dnt  *  dyft  -  hi  dt  , 

Xf(0)  »  E(xf{0))  ,  p(0)  *  cov  (xf(0))  , 


where  xf  is  the  estimate,  and  p(t)  Is  the  error  covariance  matrix. 
Here  dnt  Is  the  Innovation  process. 

Step  3  The  bilinear  approximation  model  for  the  system  in  (3-4) 
has  the  following  form: 


dx$  -  f2  (t,  ut,x)  dt  +  (t,ut)  dw 


dy5  -  h2  (t,  ut,  x)  dt  +  <*2dv2  , 


where 


f2  (x,  ut,  t)  »^ut(t)  Xj(t)  +  un+1(t) 


(3-6) 


A(t,u)  x  +  B(t,u)  , 
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h?(x,ut,t)  ■  £  u, 

c  J-l  J 


(t)  x. 


Vi(t) 


*  C(t,u)  x  +  D  (t,u)  , 


9  (t,ut)  *  gQ  (t,  ut)  . 


(3-7) 


Here  n,  the  dimension  of  the  system,  and  the  second  equality,  is  used 
for  mathematical  convenience.  The  control  u-j(t),  1  *  1  *  j  *  1,2— 
n+1  are  measurable  with  respect  to  cr-algebra  {y^seCO.t]} ,  and  are 
chosen  to  minimize  the  following  global  filtering  criterion: 


Q(u) 


(k-k)2dt. 


(3-8) 


Using  the  property  of  expectation  and  Bayes  formula,  (3-8)  becomes 


Q( 


u)  *  gin  E  (/  E(k-k)2/yf$;  s  ><  t)  dt  ) 


(3-9) 


■  min  E  (/  H(x,u)  dt 
H1  0 


where  E*  denotes  the  conditional  expectation.  Here  again,  k  denotes 
any  of  the  functions  fg,  h2»  or  gg,  while  I?  denotes  the  corresponding 
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approximation  f2,  h2,  or  §2  In  (3-7),  and  H(x,u)  *  Et(k-lc}2.  The 
arguments  (t,y,x),  (t,  ut)  are  omitted  for  brevity. 

The  minimization  of  (3-9)  with  respect  to  u-j,  1*1,2, ...n+1,  can 
be  performed  since  the  expectation  is  conditioned  on  the  o-algebra 
(yfs;  se[0,T],  which  is  not  a  function  of  the  parameters  u-j,  1  * 
1,2... n+1.  Moreover,  the  parameters  u^,  1*1, 2... n+1  are  functions  of 
the  states  x^,  1*1, 2... n  but  the  states  x^,  1*1, 2... n  are  not 
function  of  the  parameters  u-j,  1*1, 2... n+1.  In  fact,  the  problem  can 
be  treated  as  a  special  case  of  an  open-loop  control  problem  formed 
by  dynamics  (3-6)  and  cost  (3-9),  because  as  far  as  the  newly  formed 
problem  Is  concerned  the  parameters  u^,  1*1, 2... n+1  are  functions  of 
t,te[0,t].  According  to  the  argument  given  in  Chapter  2-Section  (2- 
5)  and  [02],  u^  can  be  found  by  solving  the  following  equation: 


Vt+  O.Str  (oV  a*)  +  min  [V„f(x)  +  H(x,u)]  *  0,  1  *  1,2,... n+1, 

*  xx  *  (3-10) 

where  f(.)  denotes  any  of  the  functions  f ^  or  g2»  and  Vt,  Vx, 

i 

are  as  defined  In  (2-19).  But  as  pointed  out  earlier,  f(x)  are  not 
functions  of  uj  1*1, 2, ...n+1. 


thus 

9  Urn 


0  , 


and  (3-10)  can  be  written  equivalently  as 


46 


gin  [H(x,n)3  -  0,  1*1,2, ...n+1  .  (3-11) 

Now,  to  obtain  the  parameters  u-j ,  1*1,2,  ...n+1 

which  minimizes  (3-9),  a  set  of  equations  resulting  from  (3-11)  must 
be  solved  simultaneously  and  as  pointed  out  previously,  they  are 
functions  of  (xf,  p). 

Step  4  The  new  equivalent  system  has  the  following  form: 

dxt»  (A,(t,£L)  xt  +  B,(t,ut))  dt  +  G(t,uJ  dw, 

1  1  1  (3-12) 

dyt  *  (C1(t,ut)  xt+  D1(t,ut))  dt  +  odv, 

where 

A^t.u^  *  fx(t)  +  A(t,ut),  Bjtt.u^  *  B(t,ut), 


f (t)> 

,  Dt(t,ut)  * 

(  °  1 

<C(t,ut)> 

1 

^D(t,ut)y 

1 

ox(t) 

G1(t,ut)  *  gj(t)  +  gQ(t,  ut),  a  ■ 


» 


0  ^ 
°2(t>. 
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Here  again  the  matrices  are  of  compatible  orders. 

Step  5  With  certain  assunptlons  about  ( A j,  Bj,  C]_,  Dj.Gi,  o) 
and  the  distribution  of  the  Initial  state  Xq  given  y0  (see  Chapter  2 
Sections  2-3),  the  corresponding  conditlonally-Gausslan  filter  Is  of 
the  following  form: 

dxt  *  (AjXt  +  Bj)  dt  +  S  dv  , 

S  -  (rt  C*)  (co*)-°-5  , 

dv  *  («o*r°‘5(dyt  -  CCjX^  +  Oj  dt])  , 

drt  *  ( Ax  rt  +  rt  Aj*  +  GjGj  -  SS*)  dt  , 

where  A^,  Bj,  C^,  0^  are  as  In  (3-12),  and  the  arguments  (t,ut)  are 
again  omitted  for  brevity.  A  second-order  example  to  demonstrate  the 
above  algorithm  steps  will  be  given  In  the  following  chapter. 


(3-13) 


Itoaarks: 

(1)  The  controls  u-j  l»1»j«l,2,...n+l  are  suboptlmal,  since  they 
are  functions  of  the  observations  In  the  first  stage.  If  all 
of  the  observations  are  used  In  the  first  stage  (l.e.  the 
observations  and  the  system  are  linear),  then  the  controls 
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u-j  ,  1  *  l,2,...n+l  are  optimal  (In  m.s.e). 

(II)  In  using  this  algorithm,  the  control  parameters  are  obtained 
without  solving  the  stochastic  Bellman  equation.  However,  It 
Is  still  necessary  to  evaluate  the  conditional -expectation 
expressions  In  (3-9). 

(III)  In  general,  the  state  estimate  obtained  In  Step  5  Is  an 
improvement  relative  to  that  obtained  in  the  first  stage,  and 
that  due  to  the  use  of  all  the  Information  in  the  last  step. 

( 1  v)  If  a  “local -time"  filtering  criterion  Is  assumed,  then,  as  has 
been  shown  by  Kolodzlej  and  Mohler  In  [K3],  there  Is  no 
control -law  calculation  since  the  approximation  parameters  are 
not  functions  of  ut.  But,  the  evaluation  of  similar 
conditional -expectation  expressions  are  still  required. 


Figure  3-2 

Schematic  Representation  for  the  TNF 
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Chapter  4 


4 
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4.  TNF  APPLICATIONS  AND 
SIMULATIONS  STUDIES 


The  Implementation  of  the  proposed  two-step,  nonlinear  filter 
(TNF)  are  demonstrated  via  applications  to  a  certain  class  of 
nonlinear  system.  In  [K2]  the  applicability  and  effectiveness  of  the 
filter  have  been  demonstrated  for  a  nonlinear  system  that  lacks  the 
usually  necessary  model  "smoothness".  Obviously,  the  TNF  Is  also 
applicable  to  "smooth"  models  as  demonstrated  herein. 

The  examples  that  are  treated  here  were  chosen  from  practical 
applications  to  Illustrate  the  proposed  procedures. 

The  claims  made  for  Improved  performance,  are  verified  through 
computer  simulation  results  In  the  following  sections. 

4-1.  General  Simulation  Comments 

The  digital-simulation  examples  that  are  discussed  in  the 
succeeding  sections,  were  coded  In  FORTRAN  In  such  a  manner  that  the 
program  generates  solutions  to  the  states  of  the  original  system,  and 
the  estimated  values  of  the  states  by  both  the  TNF  and  the  EKF. 
Moreover,  throughout  all  the  simulation  cases,  the  Wiener  processes 
w^  which  describe  the  excitation  noises  are  generated  from  pseudo¬ 
random  Gaussian  variables  vj  N(0,1).  The  latter  generated  by  a 
standard  (IMSL)  Library  Subroutine.  And,  Increments  of  wt  are 
approximated  by  dw  ■  AT  ,  where  at  Is  the  Integration  step- 


size  (.001  to  .0001).  In  the  simulation,  a  fourth-order,  Runge- 
Kutta  Integration  algorithm  Is  used  for  all  trajectory  filters  and 
differential  equations  of  both  the  original  system  and  the  error- 
covariance  matrices. 

The  performance  of  the  two  filters  are  compared  on  the  basis  of 
the  "mean-square  error"  (m.s.e)  of  the  filter  output  to  xt  such  that: 

JTHF  *  £  “  *TNF(t,)2dt  • 

(4-1) 

T  o 

dEKF  *  l  “  *EKF^)  * 


and  JJ  gives  the  relative  (percentage)  difference  between  J-p^tt), 
JEKF(t)  such  that 


JJ 


'EKF 


(t) 


TMF 


(t) 


EKF 


x  100  . 


(4-2) 


Actually,  nwerous  simulation  tests  were  conducted  during  these 
simulation  studies,  however,  only  a  small  representative  sampling  of 
the  results  have  been  presented  here.  Nevertheless,  these  results, 
together  with  the  ones  presented  In  [K2],  clearly  demonstrate  the 
effectiveness  of  the  proposed  filter.  It  should  be  noted  that 
throughout  the  simulation  process,  the  following  Identification 
symbols  have  been  used  to  Identify  the  different  plots  In  the 
succeeding  Figures  of  the  sequel:  the  "o"  (solid  line)  Is  used  to 
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identify  the  state  trajectory  of  the  original  system,  “v"  (dashed 
line)  to  mark  the  output  of  the  TNF,  while  "x"  (dashed  line)  is 
reserved  for  the  output  of  the  EKF. 

4-2  Exa^le  1 

An  Important  underwater  application  Is  the  sonar  tracking  of  a 
moving  rigid  body.  This  could  be  an  active  tracking  of  multi-mode 
range  system  or  passive  tracking  with  multi -receiver- transmitter  and 
correlated  time  delay.  The  rigid  body  considered  here  Is  a  point 
mass. 


4-2 -a  Problem  Definition  and  Model  Formulation 

The  problem  at  hand  Is  to  develop  an  optimal,  (or  at  least 
suboptlmal),  nonlinear  finite-dimensional  estimation  algorithm  for 
the  range  and  range  rate  of  a  rigid  body  based  on  the  noisy 
observation  of  Its  position  and  velocity  provided  by  a  sonar  signal. 

The  state  vector,  (Range  ■  Xj^,  Range  Rate  *  x2),  evolves  accord¬ 
ing  to  the  following  stochastic  differential  equation  [PI]: 


dx  ■  A  x  dt  +  Gdw  , 


(4-3) 


fd*ll 

fo  1> 

fo\ 

,  A  « 

,  G  - 

<0  -  aj 

l  «  ) 

«  *  1/t  , 


where,  dx 


w  is  a  Wiener  process.  Here  Is  Influenced  randomly  by  target 
maneuver  which  Is  characterized  by  t  the  maneuver  time  constant. 

The  measurement  equations  are  nonlinear  due  to  the  "acoustic 
propagation"  time  delay,  when  tracking  Is  derived  directly  in  current 
time.  The  observation  equations  are 

dy  a  H  (x,t)dt  +  Rd  W2  ,  (4-4) 


where  dy  * 

H 

,  H  (x.t)  ■ 

\  *  ViV 

,  R  - 

o'' 

v.  x2  . 

°2 

W2  Is  a  Wiener  vector  of  measurement  noises,  Bj  *  a/c,  a  Is  a 
constant,  and  c  Is  the  average  speed  of  sound  In  water. 

4-2 -b  Filtering  A1  gorl the 

To  find  an  approximate  finite-dimensional  filter  for  the 


nonlinear  system  In  (4-3),  (4-4),  the  decomposition  scheme  and  the 
corresponding  filtering  algorithm,  discussed  In  the  previous  chapter, 
are  utilized. 
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Subsystem  I 

Here  only  the  range  rate  Information  Is  used.  Thus,  the  linear 
system  and  observation  equations  are: 


dxj  *  X2dt  , 

dx£  *  *  o  X2  +  Cj  s  (1kj  , 

(4-5) 

<ty2  *  X2  dt  +  02  d»2  » 


where  c1  Is  a  multiplicative  constant.  Using  the  Kalman-Bucy  method, 
the  estimate  ,  1  ■  1,2  satisfies  the  following  linear 
stochastic  equations: 


p 

d*lf  ■  hi#  *  oj  dv 
P2 

dx2f B  "aX2fdt  +  aT  dv* 


t 


where  «  Etxl/yg) .  1  *  1,  2,  the  conditional  expectlon,  and  dv 
Is  the  Innovation  process.  The  corresponding  error-covariance 
equations  are: 


•  (2P3  -  P32/  o22)  dt, 


(4-6) 
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P  2 

dP2  +  (-2oP2  +  CjV  -  -Ij)  dt  , 

°2 


dP, 


(P«  -  oP- 


P2P3 


.)  dt 


Subsystem  II 

Let  XjX2  ■  (u^  +  UgXg  +  u3)  .  (4-7) 

Here  u1(  1  ■  1,2,3  are  Measurable  with  respect  to  the  o-algebra  (y£s * 
sc[0,t]),  and  chosen  to  nlnlalze  the  following  global  criterion: 

T  , 

J(u)  ■  «Jn  E  (  £  (XjX2  -  (UjXj  ♦  u2  x2  +  u3)r)dt.  (4-8) 


Equation  (4-8)  can  be  written  equivalently  using  the  properties  of 
expectation  as 


J(u)  •  «Jn  E  C  £  E  (XjXg  -  (UjXj  ♦  UgXg  +  u3))2/y2s;s«C0,T])  dt] 


T  t 

C/  2t(x. 


V  -  R  ♦  KZ) 


dt] 


(4-9) 


where  K  ■  ujXj  ♦  113X2  +  03X3 
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R  *  2  x^XgK  . 

Let  H(x,u)  -  Et(x12x22)  -  Et(R)  +  E^K2)  . 


(4-10) 


Then  (4-9)  becomes 


J(u) 


min 

U1 


E  (  /  H(x,u)  dt) 
0 


(4-11) 


Here,  the  minimization  In  (4-41)  with  respect  to  u^,  1  *  1,2,3  can  be 
performed  since  the  expectation  Is  conditioned  on  the  o-algebra  (y2s; 
se[0,t])  which  Is  not  a  function  of  the  parameters  u^,  1  *  1,2,3. 
And  the  fact  that  the  unobservable  states  x1(  1  *  1,2  are  not 
functions  of  the  control  parameters  u^,  1  *  1,2,3.  Accordingly, 
3Et(x.2x22) 

- - *  o  for  1  *  1,2,3  .  Mow,  using  similar  arguments  as  in 

Chapter  3,  the  minimization  of  (4-11)  requires  that 


Vt+  0.5  tr  (o  Yxx  o*)  +  mjn  [Vx  x^t  H(x,u)]  *  0  for  1*1, 2, 3  . 

1  (4-12) 


Thus,  performing  the  minimization  with  respect  to  u^,  1*1, 2,3  In  (4- 
12),  the  following  set  of  equations  (which  must  be  solved  simultan¬ 
eously)  are  obtained: 
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-  2  Et(x12x2)  +  2Sj  E*  (Xj2)  +  2u3  Et(x1)  +  2u2Et(x1x2)  •  0, 

-  2  Et(Xg2x1)  +  2  fl2  t^x22)  +  2  Q3  Zl{x2)  +  2  ffjE^x^)  -  0  , 

(*-13) 

-2  Et(x1x2)  +  2  u3  +  2  Uj  Et(x1)  +  2  u^fXg)  -  0  . 

To  calculate  Uj»  u2»  u3,  which  minimize  (4-9)  or  (4-11),  from  (4- 
13)  the  conditional  expectation  expressions  must  be  evaluated 
first.  From  the  basic  definition, 

Et(xi2)  -  Pi  +  Xj2,  1  *  1,2  .  (4-14) 

To  evaluate  the  expressions  Et(xi2Xj)  ,  Et(x1Xj2)  , 

Et(xixj)  ,  1,j  ■  1,2  ,  the  results  of  Lemmas  (Bl,  B2)  from  Appendix 
B  are  utilized.  Thus, 

Et(xi%>  *  2Vu  ♦  «j  “h  • 

(4-15) 

E*  (x,*,2)  ■  2Sj  “ij  *  *,  “jj  . 


Et(x 


1xj) 


Vi  +  P3 
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Then, 


Hence 


Then, 


where 


from  (4-14),  (4-15)  In  (4-13), 

fll  *  hf  * 
h  *  xlf  ' 

*3  *  P3  ’  XlfX2f  * 

X1X2  *  x2fxl  +  x2fx2  +  P3  ’  xlfx2f  * 


the  new  equivalent  system  Is 


dx  ■  A  x  dt  +  GdWj^  , 


dy  »  H  x  dt  +  D(x)  dt  +  Rdw2  , 


H  » 


Bjfl  + 


D(x) 


B1  S3 


1 


A,  G  are  the  same  as  In  (4-3). 


(4-16) 


(4-17) 


(4— 18> 
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Now,  assume  the  following: 

«»  m 

(i)  if  f  denotes  any  of  the  functions  A,  G,  H,  D,  R,  then 

P  (  /T  |  f  I  dt  <  -  )  -  1; 

0 

(1i)  Xg  given  yg  is  conditionally  Gaussian.  Then,  from  (2-12)  the 
corresponding  conditionally-Gaussian  filter  Is 

dXj  *  Xgdt  +  -l?  (Tj  (1  +  BjUj)  +  Bjiig  r3) 

®1 

dx2  »  -ax2  dt  +  [J^.  (r3(l  +  B1u1)  +  BjU^)] 

°1 

where, 


r3 

dVj  +  2  dv2  , 


dv^+ 


°2 

r. 


(4-19) 


dv, 


2  ’ 


dVj  «  dyj  -  (1  +  “x®  1*1  +  Bi“2*2  +  Bl“3^  ’ 


dv2  *  *^2  "  *2  * 


and  B},a  are  defined  as  before. 
The  covariance  equations  are 
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d  r3  »  {r2-  or3-  (r^l+B^)  +  r38^u2)  (r3(l+B1u1) 


(4-20) 


+  r2BlQ2)]  '  “T  (r2r3)}  dt* 
°2 


where  u2,  S3  are  as  In  (4-16). 

Notice  that  In  this  case  (4-19),  (4-20)  are  the  same  as  the  filter 
equations  of  the  modlfled-second-order  truncated  filter  defined  by 
Jazwlnskl  [JI]  because  the  nonlinearity  is  of  second  order,  although 
the  approach  Is  different. 


4-2 -c  Extended  Kalman  Filter 

The  filter  equations  are  C«J1]: 

d  Xj  «  *2dt  +  -kjr  ZMfa)  Pi  +  P3Bi*i^  dvi 

ffl 


(4-21) 
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dX2  ■  -o  Xtdt  +  — ig.  C(1  +  B^)  P3  +  P2B1X1)]  dVj 
ffl 


+  P 


2 


9 


where  Bj.a  are  as  defined  before,  and 
dvl  *  (dyl"  11  +  BiV  *i>» 

dv2  "  (<^2  ‘  V  * 


The  covariance  equations  are: 


/ 


dP3  -  {P2  -  oP3  -  C— (l+e1x2)P1+  x1b1p3)  ((1  +  Bxx2) 
°1 

P3  ♦  BjFjIj)  -  -i*  P2P3}  dt  * 
a2 


:  - 
t  J 
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4-2 -d  SI  aul  at  Ion  Results 

Here,  the  previous  sonar  problem  Is  simulated,  where  both 
equations  (4-3),  (4-4),  and  the  TNF,  EKF  equations,  (4-19),  (4-20), 
(4-21),  (4-22),  were  solved  by  the  digital  computer.  Numerous 
simulation  tests  were  conducted  for  this  problem  using  different 
Initial  conditions  and  parameter  values.  But  only  a  representative 
sampling  of  the  results  have  been  presented  here.  These  simulation 
results  were  tabulated  In  Tables  (1,2),  where  represents  the 
percentage  position  (range)  error  (In  m.s.e)  accuracy  of  the  TNF  as 
coopered  to  the  EKF,  while  JJ2  represents  the  percentage  velocity 
(range  rate)  error  (In  m.s.e)  accuracy  of  the  TNF  as  compared  to  the 
EKF  range-rate  output.  In  all  cases,  0Jlf  JJ2  were  calculated 
according  to  the  equations  In  (4-1).  In  figures  (4-9)  -  (4-16)  the 
range  root-mean-square  (rms)  error  Qj(t),  Q2(t),  and  the  rms  velocity 
error  VQ^t),  VQ2(t),  for  both  the  TNF,  the  EKF  respectively,  were 
calculated  as  follows: 


jl  -  sJmiv)*  5 

I  - R - 


(4.23) 


(4-24) 
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where  (xj^(tk),  x^^{tk))  are  the  jth  components  of  the  true 
state  and  Its  corresponding  TNF,  EKF  estimates  at  time  t^  on  the  1th 
simulation  run.  In  a  series  of  IL  runs.  For  completeness,  some 
comments  on  the  filter  Initialization  seem  In  order  here.  Under 
actual  operating  conditions  It  is  extremely  difficult,  and  Indeed 
rare  due  to  one  reason  or  another,  to  obtain  reliable  Initial 
estimates  of  the  state  vector  and  Its  associated  covariance  matrix. 
Consequently,  the  following  set  of  Initial  conditions  are 
realistically  chosen.  Throughout,  the  Initial  range  value  Is  5000 
meters,  while  the  Initial  range  rate  value  Is  assured  constant  and 
chosen  from  the  following  set.  (50m/sec,  500  m/sec,  1000  m/sec).  The 
Initial  condition  of  the  estimates  are  calculated  according  to  the 
following  equation: 

Xj  (0)  »  Xj(0)  +/!Tft0Tn1  ,  1*1,2  (4-25) 

where  nj  Is  a  random  noise.  The  Initial  covariance  matrix  is: 


P(0) 


Pj(0) 

P3(0) 

106 

i — 
CM 

O 

*■4 

P3(0) 

Pz(0) 

a 

102 

104 

L  J 

9 


where  the  diagonal  elements  of  P(0)  are  chosen  relatively  large  so 
that  the  filter  will  "forget*  the  Initial  values  as  more  data 
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arrived,  and  to  ensure  the  randomness  of  the  Intlal  estimates.  In 
all  cases,  a  system  noise  of  1%  of  the  Initial  state  values  Is  used, 
and  different  levels  of  measurement  noises  (from  2%  -  20%)  of  the 
Initial  range,  range  rate  respectively,  are  added.  For  convenience, 
the  time  Interval  for  each  run  Is  10  seconds,  and  the  nunber  of  runs 
for  each  simulation  test  case  Is  between  10  to  20  runs.  Thus,  all 
results  have  been  ensemble  average  over  IL  runs;  [the  nuaber  of  runs 
for  each  simulation  test  (10  to  20)]. 

The  effect  of  Increasing  the  nonlinearity,  (l.e.  Increases  a), 
of  the  system  on  the  filter  trajectories  and  the  rms  error  levels  are 
demonstrated  In  Figures  (4-2),  (4-6),  (4-10),  and  (4-14)  as  compared 
to  Figures  (4-1),  (4-5),  (4-9),  and  (4-13)  respectively.  Accord¬ 
ingly,  the  TNF  performance  Improved  substantially,  and  the  rms -error 
levels  Increased  enormously  as  compared  to  the  rms-error  levels  of 
TNF.  The  above  conclusions  are  also  demonstrated  by  Table  (1). 

In  coppering  Figures  (4-3),  (4-7),  (4-11),  and  (4-15)  with 
Figures  (4-2),  (4-6),  (4-10),  and  (4-14),  It  Is  noticed  that  an 
Increase  In  the  velocity  measurement  noise  standard  deviation  <*2  by 
10%  will  degrade  the  performance  of  the  TNF,  and  Improve  the 
performance  of  the  EKF,  while  the  rms-error  levels  significantly 
decrease.  However,  the  TNF  still  performs  better  than  the  EKF. 

Comparison  of  Figures  (4-4),  (4-8),  (4-12),  and  (4-16)  with 
Figures  (4-1),  (4-5),  (4-9),  and  (4-13)  respectively,  and  Table  (2) 
Indicates  that  the  EKF  gains  In  accuracy  relative  to  the  TNF  as  the 
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observations  become  more  noisy  (l.e.  Increases  the  range  measurement 
noise  standard  deviation,  to  20%).  This  Is  due  to  the  fact  that 
the  nonlinearity,  (here  In  the  range  measurement  equation).  Is  masked 
by  the  large  measurement  noise.  In  essence,  this  Is  In  total 
agreement  with  the  remarks  pointed  out  by  Jazwlnskl  [Jl]  In  his 
criticism  of  Schwartz's  simulations  [S2]. 

Finally,  It  Is  generally  noted  that  the  mean-square  errors 
Qj2(t),  VQj2(t) ,  Q22( t) ,  VQ22(t)  of  the  estimators  and  the  optimal 
error  covariance  I^ft),  r2(t),  and  P^(t),  P2(t)  of  the  TNF,  EKF 
respectively  are  not  the  same.  That  is  due  to  the  fact  that 
averaging  over  (IL)  samples  paths  (10-20)  does  not  give  a  good 
approximation  to  the  expectation. 

From  the  tables  and  figures  mentioned  above,  the  following 
remarks  seem  In  order: 

(I)  In  almost  all  the  cases,  (except  Table  2),  the  TNF  shows 
significant  Improvement  In  filter  accuracy  as  compared  to  the 
EKF. 

(II)  In  general,  the  TNF  range  rms  error,  and  velocity  rras  error  are 
much  smaller  than  correspondl ng  EKF  as  shown  In  Figures  (4-13) 
-  (4-16).  This  clearly  demonstrates  the  effectiveness  of  TNF 
over  the  EKF. 


(Ill)  The  computer  utilization  cost  of  the  TNF  Is  around  10%  higher 
than  the  cost  of  the  EKF,  while  the  storage  requirements  are 
relatively  equal.  Thus,  the  computer  cost  consideration  will 
not  be  a  preference  factor  as  far  as  this  application  Is 
concerned. 

(1v)  Finally,  the  complexity  of  the  proposed  algorithm  (TNF)  over 
the  EKF  might  be  justified  by  the  significant  Improvement  In 
the  filter  accuracy. 
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4.3  Example  2 

In  this  example  a  new  nonlinear  filtering  and  tracking  technique 
[K3],  which  Is  similar  to  the  TNF,  is  applied  to  a  passive-sonar-two 
dimensional  problem.  This  technique  Is  suited  for  the  type  of 
problem  under  consideration,  where  the  observation  equation  1$ 
scalar.  Thus,  application  of  the  TNF  will  encounter  difficulties 
with  respect  to  control  policy  calculation.  However,  this  new  method 
does  not  require  control  calculation,  but  unfortunately,  the  global 
properties  are  lost. 

4-I-a.  Problem  Definition  and  Model  Fonulatlon 

This  problem  describes  the  two-dimensional,  bearlngs-only  target 
motion  [A2].  Figure  (4-17)  presents  a  geometric  configuration  of 
both  the  target  and  the  observer,  where  It  Is  assumed  that  both  lie 
In  the  same  horizontal  plane. 

It  Is  assuied  that  the  system  behavior  evolves  according  to  the 
following  stochastic  differential  equations: 

&  *  A  xt  +  B  ut  ♦  G  wu  ,  (4-26) 


where 


i 


Xj,  relative  range  component  In  the  x  direction 
Xj,  relative  range  component  in  the  y  direction 
Xj,  relative  velocity  component  in  the  x  direction  * 
x^,  relative  velocity  conponent  In  the  y  direction 


u  « 


uox»  relative  acceleration  In  the  x  direction 


uy  -  u0y,  relative  acceleration  in  the  y  direction 
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ux,  Uy  are  acceleration  of  the  target  In  the  (x,y)  direction 
respectively. 


0  0  10 

0  0  0  1 

B  - 

0 

0 

0 

0 

G  » 

—  «m 

0  0 

0  0 

0  0  0  0 

9 

-1 

0 

» 

Rj  0 

0  0  0  0 

0  -1 

m  «■ 

0  r2 

*  «• 

Here  wit,  (wltx,  wlty),  Is  an  additive  noise  to  model  random  target- 
acceleration  fluctuations  from  the  assumed  constant  velocity 
trajectory,  (with  ux  »  uy  ■  0). 

The  measurement  data  consists  entirely  of  passive  sonar  bearings 
as  follows: 

S(t)  ■  Arctan  Cx^tl/XgU)]  +  on(t)  ,  (4-27) 


1 


where  6 ( t)  fs  the  measured  target  bearings,  and  n(t)  Is  an 
Independent  additive  Gaussian  measurement  noise  and  has  variance 
o^(t).  Moreover,  the  observation  Is  almost  specially  continuous  If 
we  consider  the  scenario  for  an  Isolated  submarine  tracking  a  surface 
ship  or  far  distance  submarine.  Now,  If  we  let  <ty  *  Bdt  then  (4-27) 
can  be  written  as 


dy  *  H(x^,x2,t)  dt  +  <t  dwg  , 


(4-28) 


where  dw2  *  dt  Is  a  Wiener  process,  M( t) 


tan“1( 


Xt(t) 

x^TTF 


4-3-b.  Filtering  Algorithm 

The  system  In  (4-26),  (4-28)  Is  a  nonlinear  system  with 

nonlinear  obs*<'vat1ons.  In  [A2],  [Cl],  It  has  been  shown  that  the 
EKF  suffers  from  the  "111 -conditioning"  phenomena  due  to  the  error 
covariance-matrices  false  observability  [A2].  The  new  approach  [K3] 
proceeds  as  follows:  First,  the  approximating  system  Is  defined  with 


fl  (Xj.Xg.y.t) 


hg(y)  +  Chjty),  h2 (y ) 3 


(4-29) 


Now  assuae  the  following: 

(1)  If  f  denotes  any  of  the  functions  F,  B,  G,  hg,  hj,  h2,  then 
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P(  /T|f|  dt  <  •)  *  1; 

0 

II)  xg  given  yg  Is  conditionally  Gaussian; 

III)  hg,  hj,  h2  are  yt  measurable. 

Then,  an  approximating  model  for  the  target  and  receiver  equations 
(4-26),  (4-28)  Is  presented  by: 

dx  *  (Ax  +  Bug)  dt  +  G  dwj  , 

(4-30) 

dy  -  (hg  +  kx)  dt  +  a  d  w2  , 

where  k  »  (hj,  1^1,  w^,  w2  are  Independent  Wiener  processes,  Ug  * 
(“Ox*  u0y)»  ^e  observer  acceleration.  The  arguments  (xlt  x2,  y,  t), 
(y)  are  oorltted.  Also,  the  assunptlons  significantly  provide  that  in 
(4-30)  xt,  yt  are  conditionally  Gaussian.  Hence,  from  (2-12),  (see 
Chapter  2-Section  2-3),  the  recursive  formulas  for  xt  (the 
conditional  mean),  and  rt  (the  conditional  covariance)  are: 

d*t  »  (AX  +  Bug)  dt  +  i  Cdy  -  (hQ  +  k*t)  dt]  , 

(4-31) 

drt  -  (Art  +  rtA*  +  gg*  -  ss*)  dt  , 

* 

rji 

where  S  »  —  ,  M  »  [h^,  h2,  0,  0]  . 
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The  recursive  formulas  In  (4-31)  will  completely  characterize  the 
filter  equations  If  h0,  hjt  h2  are  obtained.  Now,  using  the 
following  mean-square  criterion: 

J(u)  »  E(| IF  -  F| |2)  .  (4-32) 

Now,  from  (4-28),  (4-30)  equation  (4-32)  can  be  written  as 

J(u)  *  E  { <H  -  (hg  +  kx)(H  -  (hg  +  x*  k*)} 

-  E  {Et(Hz)  +  hg2  +  kE*  (xx*)  k*  -  2hg  E^fH) 

+  hg  kEt(x)  +  hgE^x*)  k*  -  Et(Hx*)  k*  -  KEt(Hx)}  , 

(4-33) 

where  again  Et  (  )  1"  the  conditional  expectation  operator. 

Let 

a  ■  Et(xx*)  p*1, 

b  -  p*1. 


c  »  Et(x*)  p"1. 


(4-34) 
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d  »  Et(H*)-(Et(H))2-  Et{x*}  Et(Hx*)  E*(H)  +  Et(x)  Et(Hx)  Et(H) 

-Et(Hx*)  Et(Hx)  -  Et(x)  Et(x*)[Et{H)]2  P"1, 

eQ  -  h0  +  k  Et(x)  -  Et{H)  , 

tsj  »  hQ  Et(x*)  +  k  Et(xx*)  -  Et(Hx*). 

Here  P  Is  a  2x2  positive  definite  matrix. 

Now,  (4-33)  can  be  written,  using  (4-34),  as 

E  {(a  ege0*  +  b  e^*-  c  e^*-  e1eQ*c*  +  d)}  .  (4-35) 


Since  ab  >  c*c,  then  the  minimization  of  (4-35)  requires  that  e0  »  0, 
e^  *  0.  Thus, 

h0  •  Et(H)  -  [E^Hx*)  -  E*(H)E(x*)]  P"1  E*(x)  , 

(4-36) 

k  «  [Et(Hx*)  -  Et(H)  Et(x*)]  p“l  -  [hj.hg]  . 

Hence,  from  (4-36),  (4-29)  becomes 

fi  (xpXg.y.t)  -  Et(H)  +  [Et(Hx*)  -  E*(H)  Et(x*)]p'1(x-Et(x))  . 

(4-37) 

To  evaluate  the  conditional -expectation  terms  In  (3-35),  the 
following  approximation,  (see  (2-13),  (2-14)),  can  be  used. 
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Let  f  denote  either  of  the  functions  (H,  Hx  ),  then. 


Et(f)  *  /  f(s,yt,t)  d  «  (x,  rt,  O  =  f  (x.  y,  r,  t)  , 
R 


(4-38) 


where  d  ♦  (Xt,  r,  5)  Is  a  differential  Gaussian  measure  as  defined 
In  (2-14).  Thus,  using  (4-38),  (2-14) 

Et(H)  *  /  /  arc  tan  (_!)  y  Exp  -  0.5  C(x-x)  r"1(x-x)]  dXjdx2  , 

(4-39) 


E*(Hx*)  -  [Et(Hx1) ,  E^HXg)]  ,. 


(4-40) 


n 

E^(HXj)  »  y'lj.  |  /  /  Xjarctan -ji  Exp  -  0.5  [(x-X)*!1'1  (x-X)]  dxjdx2. 


(4-41) 


E^HXg)  «  2^'  -jf  | /  /  x2  arctan(-^i)Exp  -  0.5  [(x-x)*  r-1(x-x)]  dxjdx2. 


But,  analytical  evaluation  of  (4-39),  (4-41)  Is  very  difficult. 
Thus,  the  following  approximation  scheme  Is  used,  where  the  arctan 
function  Is  first  expanded  by  Taylor  series  for  a  function  of  two 
variables,  then  find  the  conditional  expectation  for  each  terms  using 
equation  (4-38),  (2-14).  Thus,  using  the  Taylor  series  expansion  up 


1 

1 
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X  X 

Et(x2  arctan  ■jlO  »  XgE^arctan  -I) 


+  Ag  +  +  Cj 


„  2  ; 
r2  X1 


+  c2  r2  r12  x2  +  c3  x22  Xj  , 


where 


Cr, 


x2-  r12 


xl3 


*2  “ 


Cr12  x2 


2  *1- 


(r, 


r2  + 


2r 


12 

T 


!)(x,3-  x, 


«l* 


3522) 


>-  2 


Oxj"  -  x^) 


-3  r3  Xx  Ct  , 


-3  r2  r3  x-2  c2 


Therefore,  from  (4-42)  and  (4-43),  (4-36)  becomes 
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(4-43-b) 


(4-43— c) 


(4-43) 


hl  “  dl^l  r2  “  k2  r12] 


(4-44) 
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*>2  *  dj  [k2  r  j  -  kj  r12], 

where 

kl  «  [Ax  +  Bx  +  cx  Xj  Tl  r3  ♦  C3  ♦  c2  x2  rx2]  , 
k2  -  CA2  ♦  Bj  ♦  C2  ^  rx  r3  +  c4  +  4  Sj  r22]  , 

<*i  «  Ci/0*i  r2  -  r122)]  , 

^1*  ®1*  ^l*  ^2*  ^3  defined  In  (4-43).  Butf  In  (4-31) 

0 

s  *  7  Crt  Mt]  •  M  *  £hi*  h23* 

Then,  fro*  (4-44), 

rl  hl  +  r12  h2 
r21hl  +  r2  h2 
r13hl  +  r23  h2 
r14hl  +  r24  h2 

Thus,  from  (4-45),  the  filter  equations  (4-31)  become 
dXt  *  (A  Xt  +  Bu0)  dt  +  |  (dy  -  Et(H)  dt)  , 
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d  rt  -  (A  rt  +  rt  A*  +  GG*  -  s$*)  dt  , 

where  S  Is  given  by  (4-45),  E^H)  by  {4-42),  and  A,  6, 
t  Here  rt  Is  a  4x4  matrix,  while  x  Is  a  4x1  vector. 


1 


4-3-C 

The  corresponding  EKF  equations  are 

* 

P  N  x 

dxt  ■  (A  SL  ♦  Bu  )  dt  +  (dy-arctan  -i-  dt)  , 

a  x2 

d  Pt  -  (A  Pt  +  Pt  A*  +  GG*  -  V-^!!*)  dt  , 


where  N  * 


3  Ht  (x,t) 


Tx: 


*  x22)2 

-X  J2  /(Xj2-*-  x22)2 

0 

0 


t 


and  (A,  B,  G,  a)  are  as  defined  In  (4-26). 


L 


(4-46) 


by  (4-26). 


I 

(4-47)  i 


4-3-d  Simulation  Results 

The  passive  sonar  problem  described  by  equation  (4-26),  (4-28) 
and  tiie  estimated  algorithm  by  both  TNF,  EKF  which  have  been 
described  by  (4-46),  (4-47)  respectively ,  are  simulated  on  the 

digital  computer. 

To  compare  the  estimator's  performance,  the  following  scenario 
Is  devised.  The  target  Is  at  an  Initial  range  of  2700  yards,  moves 
at  a  constant  speed  of  675.13  yards/ml n. ,  and  maintains  a  steady 
course  of  0°.  In  addition,  the  Initial  bearing  Is  0°.  Own-ship  Is 

assumed  to  be  at  the  origin  Initially,  maintains  a  constant  speed  of 

954.63  yards/ml n.,  but  periodically  executed  90°  course  changes  as 
follows: 

from  45°  to  -45°  at  t  »  (4  +  17k)  [k  -  0,1]  , 

from  -45°  to  45°  at  t  -  (12.5  +  17k)  [k  *  0,1]. 

The  own-ship  course  changes  at  the  rate  of  3°/second. 

Numerous  simulation  tests  were  conducted  for  this  problem  using 
different  levels  of  measurement  noises  and  initial  conditions.  But 
only  representative  sampling  of  the  result:  have  been  presented 
here.  These  results  are  tabulated  In  Table  3  and  are  shown  In 
Figures  (4-18)  -  (4-25)  for  additive,  rms,  measurement-noise  levels 
of  3°,  12°,  respectively,  and  JJj,  J0£  are  as  defined  in  (4-22). 

The  initial  state  values  were  respectively 
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x( 0)  ■  [0,  2700,  -  675.13,  0]  , 

while  the  estimates  were  Initialized  according  to  the  following 
equation 

Xf(0)  •  Xj(0)  [1  +  .01  nj],  i  »  1,2, 3, 4, 
where  n^  Is  a  random  noise.  The  initial  covariance  matrix  was 
p(0)  •  dlag  [106,  106,  152,  152]  . 

Moreover,  In  all  the  simulation  cases,  system  noises  are  added  to  the 
velocity  states  (X3,  x+)  to  compensate  for  random  target-acceleration 
fluctuations  from  the  assumed  constant-velocity  trajectory  {uxt  »  uyt 
■  0). 

For  convenience,  the  time  Interval  for  each  run  Is  20  minutes, 
and  the  number  of  runs  for  each  simulation  test  case  Is  10  runs. 
Thus,  all  results  have  been  ensemble  averaged  over  10  runs. 

The  relative  range  trajectory  and  Its  corresponding  estimates  by 
both  the  TNF,  EKF  were  simultaneously  plotted  In  Figures  (4-18)  -  (4- 
19)  for  the  ms-noise  levels  of  3°,  12°  respectively.  And  the 
relative-velocity  trajectory  and  Its  estimates  were  shown  In  Figures 
(4-20),  (4-21)  respectively.  Figures  (4-22M4-23)  showed  the  range 
rms -estimation  errors  (Q^ft),  Q2U))  associated  with  TMF,  EKF 
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respectively  and  calculated  In  accordance  to  equation  (4-23)  for  the 
same  noise  levels  as  above.  In  addition,  figures  (4-24),  (4-25) 
showed  the  velocity  rms  estimation  errors  (VQjft),  YQ2(t))  calculated 
by  equation  (4-24)  for  TNF,  EKF  respectively. 

Several  conments  can  be  drawn  from  Table  3  and  Figures  (4-18) 
through  (4-25).  First,  Figures  (4-18)  -  (4-21)  showed  that  state 
estimates  begin  converging  to  their  true  values  after  own-ship 
executes  a  maneuver  relative  to  target  motion. 

Figures  (4-19),  (4-21),  (4-23),  (4-25)  showed  the  effect  of 
Increasing  the  measurements  error  to  an  rms  value  of  12°.  The  EKF 
performance  Is  Improved  while  the  TNF  performance  Is  degraded  a 
little,  but  still  the  TNF  performs  significantly  better  than  the 
EKF.  The  above  conclusions  are  also  demonstrated  by  Table  3. 

Finally,  It  Is  generally  noted  that  the  covariance  becomes 
smaller  with  decreasing  range  and  grows  as  range  Increases,  This  was 
typical  In  all  of  the  simulation  cases,  and  pointed  out  the 
Involvements  of  nonstationary  processes.  It  Is  also,  noteworthy  to 
point  out  that  when  own-ship  does  not  maneuver  at  all,  both  filters 
diverge  (generating  biased  range  estimates),  but  the  EKF  diverges 
faster  than  the  TNF. 

In  closing,  the  following  remarks  are  perhaps  In  order: 

(1)  In  general,  the  TNF  rms  errors  are  much  smaller  than  their 
corresponding  EKF  rms  errors.  This  demonstrates  that  the  TNF 
Is  more  effective  than  the  EKF. 


j 
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(II)  Own-ship  maneuvers  relative  to  a  constant-velocity  target 
enhance  convergence.  Thus  own-ship  maneuver  or  target  maneuver 
Is  essential  to  bearings  only  measurement  analysis. 

(III)  Computer  utilization  cost  and  storage  requirements  are  almost 
equal  for  both  algorithms.  Thus  computation  efficiency  Is  not 
a  decisive  factor  In  this  application. 

(1v)  Finally,  these  results  admittedly  are  not  exhaustive.  Thus, 
more  simulation  tests  are  needed  especially  for  the  case  when 
the  target  Is  maneuvering.  Also,  comparison  (via  digital 
simulation)  of  the  THF  with  the  MP  (Modified  Polar  Coordinates) 
filter  developed  by  Adlala  [A3]  warrant  further  consideration. 


t 
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TABLE  1 

Synopsis  of  the  Percentage  Accuracy  of  TNF  over  EKF 


Cl  a 

JJX% 

JJ2* 

1.0  1 

10.91 

35.57 

1.0  1.5 

32.37 

57.14 

1.0  2 

78.60 

90.03 

1.0  3 

91.42 

98.85 

1»  2%,  a 2“  10%,  Xi(0) 

*  5xl03m,  X2(0) 

*  103  m/sec) 

TABLE  2 

The  Effect  of  Measurement  Errors  on  thePercentage 
Accuracy  of  the  TNF  over  the  EKF 


a 

3 

3 

3 

ffl 

2% 

10% 

20% 

a2 

10% 

10% 

10% 

JJX% 

91.42 

11.0 

-7.23 

JJ  2% 

98.03 

17.47 

-16.13 

(cj«1.0,  X2(0)»10^  m/sec,  Xj(0)«5xl0^m) 
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TABLE  3 

Synopsis  of  the  Percentage  Accuracy  of  TNF  Over 
EKF  as  a  Function  of  the  Measurement  rms  Error 


a  degree 

2° 

3° 

6° 

12° 

JJi* 

32.35 

38.48 

36.84 

13.69 

f 

JJ2 

98.21 

97.92 

98.71 

98.88 

|  • 
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FIGURE  4-1  RANGE  AND  ITS  ESTIMATES  BY  TNF,EKF 
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FIGURE  4-2.  RANGE  AND  ITS  ESTIMATES  BY  TNF,EKF 
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FIGURE  4-3  RANGE  AND  ITS  ESTIMATES  BY  TNF.EKF 


FIGURE  4-4  RANGE  AND  ITS 
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FIGURE  4-5  VELOCITY  AND  ITS  ESTIMATES  BY  TNF,EKF 


FIGURE  4-6RANGE  R.AND  ITS  ESTIMATES  BY  TNF ,EKF 


FIGURE  4-7  VELOCITY  AND  ITS  ESTIMATES 


FIGURE  4-8  RANGE  RATE  AND  ITS  ESTIMATES  BY  TNF,E»* 


FIGURE  4-9  RANGE  RMS  ERRORS  BY  TNFjEKF 
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FIGURE4-10  RANGE  RMS  ERRORS  BY  TNF,EKF 
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FIGURE  4-11  RAGE  RMS  ERRORS  BY  TNF,EKF 
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FIGURE  4-12  RANGE  RMS  ERRORS  BY  TNF,EKF 


FIGURE  4-13  VELOCITY  RMS  ERRORS  BY  TNF,EKF 


FIGURE  4-14  VELOCITY  RMS  ERRORS  BY  TNF,EKF 


FIGURE  4-15  VELOCITY  RMS  ERRORS  BY  TNF,EKF 
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FIGURE  4-16  VELOCITY  RMS  ERRORS  BY  TNF,EKF 
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TIM  (MlNIfTtU) 

FIGURE  4-18  RANGE  AND  ITS  ESTIMATES  BY  TNF,EKF 
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FIGURE  4-19  RANGE  AND  ITS  ESTIMATES  BY  TNFtEKF 
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TIME  (MINUTES) 

FIGURE  4-20  VELOCITY  AND  ITS  ESTIMATES  BY  TNF,EKF 


ram  <mmmea> 

FIGURE  4-21  VELOCITY  AND  ITS  ESTIMATES  BY  TNF,EK 
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FIGURE  4-22  RANGE  RMS  ERRORS  BY  TNF,EKF 
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FIGURE  4-23  RANGE  RMS  ERRORS  BY  TNF.EKF 


FIGURE  4-24  VELOCITY  RMS  ERRORS  BY  TNF,EKF 
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FIGURE  4-25  VELOCITY  RMS  ERRORS  BY  TNF,EKF 
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Chapter  5 


5.  SIMMARY  AND  CONCLUSION 


In  this  research  an  original  nonlinear  filter  approximation  Is 
developed  for  a  class  of  nonlinear  systems.  The  need  for  such  a 
filter  which  does  not  suffer  from  the  shortcomings  of  most  of  the 
linearization  techniques,  such  as  model  smoothness,  is  encountered 
frequently  in  many  Industrial,  sonar,  economic,  and  image-processing 
applications.  Obviously,  the  design  of  a  filter  which  has  Improved 
performance,  without  stringent  requirements,  and  which  is  employed 
with  comjarable  Implementation  cost  to  the  traditional  techniques, 
would  be  a  very  significant  achievement.  This  has  been  the  major 
design  goal  of  the  work  presented  herein. 

5-1  Significant  Features  of  the  New  Filter 

The  effort  was  devoted  to  the  development  of  a  new  finite¬ 
dimensional  filtering  approximation  to  the  typical  infinite- 
dimensional-nonlinear  filtering  problem.  As  a  result  of  this  effort, 
new  features  were  developed,  and  a  modest  contribution  to  nonlinear- 
filtering  approximation  theory  was  achieved.  Among  these  are  the 
foil  owl ng: 

1)  Weaker  assumptions  are  required  to  derive  the  filter  even  as  the 
solution  Is  assumed  to  be  a  weak  solution. 

2)  Impeding  generation  of  a  "close",  (in  m.s.e),  bilinear,  feedback- 
control-model  approximation  to  the  original  nonlinear  system. 
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This  has  an  Impact  on  nonlinear  approximation  theory,  since  It 
provides  formal  approximation  procedures  with  flexibility 
embedded.  That  Is  due  to  the  fact  that  the  appproxlmatlng 
parameters  are  functionals  of  the  feedback  control  ut  which 
provides  some  control  and  Insight  Into  the  structure  of  the 
approximating  model. 

3)  The  feedback  coupling  In  the  filter-covariance  equations  enhances 

the  stability  of  the  filter.  In  all  of  the  linearization 

techniques  only  forward  coupling  occurs  in  the  filter  equations 
and  that  In  essence  helps  to  destabllze  the  filter  due  to  error 
accummulatlon. 

4)  An  Important  feature  of  the  new  filter  Is  the  "global"  property 
which  allows  the  filter  to  be  independent  of  the  local -time 
discretization,  and  thus  decreases  the  noise  aggravation. 

5)  A  new  "decomposition  scheme"  Is  developed  for  a  certain  class  of 
large-scale  nonlinear  systems.  This  results  In  great 
mathematical  simplification  In  the  development  of  global 
nonlinear  multi-dimensional  filter. 

6)  This  filter  provides  an  alternative  formal  approach  to  certain 
traditional  techniques,  {such  as  MSOF  [Jl],  and  "the  Smoothness 
In  Probability  Filter  [K4]),  for  certain  nonlinear  systems  even 
though  the  approaching  method  Is  quite  different. 
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5-2  Future  Area  of  Research 

The  following  five  main  areas  warrant  further  study: 

1)  The  development  of  an  equivalent  filtering  algorithm  for  the  case 
when  the  observation  process  Is  discrete.  Thii  is  specifically 
Important  In  sonar  applications  since  observations  may  be 
received  as  batches  it  random  times.  The  following  cases  are  of 
most  Interest: 

a)  the  observation  Is  discrete  but  random; 

b)  both  the  observation  and  the  system  processes  are  discrete 
and  random. 

2)  Investigation  of  a  general  "Deconposltion  Scheme"  to  extend  the 
applicability  of  the  new  filter  to  a  broad  class  of  nonlinear 
systems.  The  use  of  a  hierarchical  optimality  scheme  Is 
promising. 

3)  Computer  utilization  is  an  Issue  that  should  be  more  fully 
explored.  Thus,  development  of  a  standard  software  package  for 
the  TNF  algorithm  would  make  the  filter  quite  attractive  to 
practical  applications.  Furthermore,  Incorporation  of  the 
ml croprocessor  would  Increase  the  economical  feasibility  of  the 
filter. 

4)  The  need  for  further  testing  and  conparfson  with  other  existing 
techniques  to  establish  filter  superiority  In  most  cases,  and  to 
enhance  Its  reliability. 


•  I Mtniiwi  *•«*>■**-*  '.ft 


5)  Investigation  of  the  use  of  "adaptive"  techniques  to  surmount  the 
difficulty  of  tracking  successfully  an  evasive  maneuvering 
target,  which  Is  a  typical  and  Important  subject  In  sonar 
applications.  The  following  techniques  are  promising: 

a)  use  of  an  additive  control  that  can  be  estimated  separately 
using  the  separation  principle  for  stochastic  conditionally 
Gaussian  systems  [Kl]; 

b)  use  a  feedback  scheme  to  correct  for  the  target  deviation 
from  Its  constant  trajectory  due  to  maneuvering. 

5-3  Conclusion 

This  dissertation  has  examined  and  expanded  the  subject  of 
nonlinear-filtering  approximation.  A  new  global -filtering  approxi¬ 
mation  procedure  has  been  developed  with  a  particular  emphasis  on  non 
Gaussian  processes.  An  Important  practical  feature  of  the  proposed 
method  1  s  the  method1  s  1 ndependence  of  the  model  smoothness 
assumption  which  Is  crucial  to  traditional  techniques.  Furthermore, 
a  major  and  equally  Important  byproduct  Is  the  generation  of  a 
"closed"  (In  the  mean-square-error  sense)  bilinear  model 
approximation  of  the  original  nonlinear  system. 

The  estimation  method  developed  herein  has  been  applied 
specifically  to  practical  problems  to  demonstrate  its  effectiveness 
and  applicability  to  a  variety  of  a  certain  class  of  nonlinear 
systems.  In  fact,  the  degree  of  ease  or  difficulty  In  extending  Its 
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applicability  to  general  multi -dimensional  nonlinear  systems  Is 
directly  related  to  the  ease  or  difficulty  In  calculating  the 
multiplicative  feedback  control -laws,  and  accordingly  In  evaluating 
the  required  conditional  expectation  of  the  nonlinear  terms.  This  Is 
often  proven  to  be  difficult,  especially  for  large-scale  nonlinear 
systems.  However,  for  low-order  nonlinear  systems,  this  difficulty 
has  been  surmounted  by  a  novel  decomposition  scheme.  This  scheme 
alleviates  the  control  problem  calculation,  and  permits  a  reduction 
of  the  multi -dimensional  Integrations  associated  with  evaluation  of 
the  conditional  expectation  expressions  to  only  a  single  Integration 
which  can  then  be  evaluated  analytically.  It  Is  noteworthy  to  point 
out  that  a  similar  approach  which  Is  developed  In  [K3]  and  has  been 
used  In  the  second  example,  does  not  require  any  control -law 
calculation  but  has  no  global  filtering  properties  either.  However, 
the  method  requires  evaluation  of  the  conditional  expectations  with 
corresponding  multi -dimensional  integrations. 

A  fundamental  limitation  Imposed  on  the  new  approach  was  the 
conditionally -Gaussian  assumption  of  Xg  given  yg.  This  restriction 
Is  very  basic,  because  the  filtering  techniques  herein  are  completely 
dependent  on  the  choice  of  the  statistical  model  for  the  underlying 
random  processes.  However,  as  pointed  out  earlier,  this  assumption 
sometimes  may  be  satisfied  under  realistic  operating  conditions, 
and,  of  course.  It  Is  more  general  than  the  traditional  Gaussian 
assuaptlon  of  both  xt  and  yt. 
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The  digital -computer  simulation  clearly  demonstrates  the 
efficiency  and  filtering  accuracy  Improvement  of  the  TNF  over  the 
popular  EKF.  Thus,  the  apparent  complexity  of  the  algorithm  and  the 
slight  Increase  In  confutation  cost  might  be  justified  by  the 
significant  Improvement  In  the  filter  performance.  But,  no  claim  has 
been  made  that  this  filter  Is  superior  to  all  other  existing 
techniques  In  all  cases.  That  certainly  warrants  further 
Investigation. 

Finally,  the  methodological  formulation  developed  In  this  work 
Is  Intended  to  generate  further  Interest  and  Insight  Into  the  design 
of  a  future  filter  for  general  nonlinear  systems.  Moreover,  It  Is 
hoped  also,  that  this  filter's  potential,  as  an  effective  filtering 
algorithm  for  a  nonlinear  system,  should  be  fully  explored  by 
practical  application  to  communication  and  tracking. 


1 


1 
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APPENDIX  A 

WIENER-PROCESS  FORMULATION  AND  WHITE-NOISE  FORMULATION  EQUIVALENCE 
Theorem  A1  Levy  (LI,  Thm  4.1,  pp.82)  defines  a  Wiener  process  as 
follows: 

Let  (a,F,p)  be  a  probability  space  and  (Ft),  te[0,T]  be  a 
nondecreasing  family  of  sub-c-al gebras  of  F.  The  random  process 
(wt,Ft),  te[0,T],  Is  called  a  Wiener  process  If 

(I)  the  trajectories  wt,  te[0,T]  are  Gaussian,  continuous  (p.a.s.) 
on  C0,T], 

(II)  wt,  te[0,T]  Is  a  square-lntegrable  martingale  [A4]  with  wg*0, 
p.a.s.  and  E[(wt-ws)(Wt-w$)*]»(sAt)I,  t>s 

Thus,  any  Wiener  process  Is  a  Brownian  motion  process  [H],[A4]. 

Definition  Al[A4] 

A  sequence  of  quadratic  mean  square  continuous  [xtn],  te[— ,«•] 
Is  said  to  converge  to  a  white  noise  If  for  each  function  f(t),  g(t), 

m 

(  /  If  (t)ldt<«),  there  exists  a  positive  constant  Sg,  l.e. 

limit  E  (  /7  f(t)  g(s)  xtnx$ndt  ds)  •  SQ  /"  f(t)  g(t)  dt. 

-m 

As  pointed  out  earlier,  a  stochastic  differential  equation  with 
an  additive,  exciting,  whlte-Gausslan  noise  Is  given  by 

»  f(xt,t)  ♦  G(xt,t)  st  . 


(A-l) 


Here,  st  Is  a  white  noise,  and  Is  thus  neither  mean  square  Rlemann 
Integrate  nor  Integrate  with  probability  one.  Hence  equation  (A-l) 
Is  not  mathematically  meaningful  as  It  stands.  However,  If  the  white 
noise  Is  as  defined  In  (A1 )  and  Is  considered  as  the  formal 
derivative  of  the  Wiener  process  (wt,  tcT),  (Thm.Al),  then  (A>1)  may 
be  considered  formally  equivalent  to 


dxt«  f(xt,t)dt  +  G(xt,t)dwt,  tcT. 


(A-2) 


-  W 

At  least  formally  It  Is  known  that  /  Btds  has  all  the  properties  of 

0 

Brownian  motion,  wt.  Hence,  (A-l)  can  be  made  meaningful  In  terms  of 
a  stochastic  Integral  equation  [02]. 


xt*  *n  +  JTf(x_,s)ds  +  /TG(x.,s)dw 
1  u  0  0 


(A— 3) 


The  last  Integral  in  (A-3)  Is  Interpreted  as  a  stochastic  Integral 
which  needs  to  be  defined.  Since,  as  It  may  be  recalled,  wt  has  a 
realization  of  unbounded  variation  In  any  small  Interval  of  time,  the 
last-integral  cannot  be  defined  In  the  usual  Lebesque-Stleltjes 
sense.  One  generally  accepted  definition  Is  due  to  Ito  [LI]  and  Is 
often  referred  to  as  an  Ito  stochastic  Integral,  and  (A-2)  1$  called 
the  Ito  stochastic  differential  equation  Interpreted  In  terms  of  the 
Ito  calculus  [A4]  which  Is  not  compatible  with  results  of  ordinary 
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calculus.  Accordingly,  the  following  definition  Is  appropriate. 


Definition  A2  [LI],  [Kl]. 

A  process  xt  Is  said  to  satisfy  (A-2)  for  t*[0,T],  with  Initial 
state  xq  If 

(I)  for  all  tc[0,T],  /tG(xe,s)dwe  can  be  interpreted  as  a 

0  s  5 

stochastic  Integral, 

(II)  for  all  te[0,T],  xt  Is  almost  surely  equal  to  the  random 
variable 


t  t 

x0+  /  f(x  ,s)ds  +  /  G(x  ,s)d*  . 
u  0  s  0  s  5 

Under  certain  conditions  Imposed  on  Xg,f,G  [LI],  (A-2)  has  a  unique, 
strong/weak,  sample-continuous,  Markov  solution.  The  strong-solution 
notion  Is  defined  as  follows 


Definition  A3  [LI] 

For  *  given  complete  probability  space,  (a,c,p),  and  a  Wiener 
process  wt,  the  stochastic  differential  equation  (A-2)  has  a  strong 
solution  xt  If: 

1)  P(/tMf(x1.,t)l|dt<-)  -  1 
0  * 


( A— 4) 
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T  ? 

2)  P(  /  llG(x.,t)||z  dt<-)  -  1  (A-5) 

0  1 

3)  Condition  (11)  of  Definition  A2  Is  satisfied.  (A-6) 

That  Implies  that  ctx«  ctw  > 

where  ct*«  a -algebra  [xs;  0<s4t], 

■  a-algrebra  [ws;  0<s<t]. 

The  weak  solution  notion  Is  defined  as: 

Definition  A4  [LI]. 

Let  F(a)  ■  P(nia),  probability  measure  for  n  random  variables. 
If  there  exist  (Q,c,p),  xt,  p-a.s.  continuous,  conditions  (A-4),  (A- 
5),  (A-6)  are  satisfied,  and  F(a  »  PUg^ag).  Then  xt  Is  a  weak 
solution  to  (A— 2)  which  Implies  ctx>  Ctw,  and  ctx,  ctw  as  defined 
previously. 

It  Is  noteworthy  to  point  out  that  the  use  of  Ito  calculus, 

t 

vrfilch  led  to  the  definition  of  the  Ito  Integra'  /  G(x  ,s)dw  in 

0 

equation  (A-3),  results  In  adding  a  correction  term  to  the  result  of 
the  ordinary  differentiation  rules  when  stochastic  process  Is 
differentiated.  This  will  help  In  transforming  the  differential  dxt 
as  In  equation  (A-2),  Into  a  form  that  can  hopefully  be  recognized  as 
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the  differential  of  a  known  function.  For  example,  consider  the 
following  stochastic  differential  equation. 

dx  *  xdw,  (A-7) 

with  initial  conditions  x{0)  *  1,  w(0)  *  0,  and  E(dw(t)^)  *  dt. 
Using  ordinary  integration  rules  or  Stratonovich  rule,  the  analytical 
solution  is 

x(t)  *  Exp  w(t)  .  (A-8) 

However,  using  Ito  integral,  the  solution  is 

x ( t )  «  Exp  [w(t)  -  0.5t],  ( A— 9) 

which  is  actually'  the  integral  of  the  following  stochastic 
differential  equation 

dx  *  x  dw  +  0.5  x  (dw)*,  ( A— 10) 

where  (dw)^  «  dt. 

The  last  term  In  equation  (A-10)  Is  what  is  referred  to  as  the 
correction  term.  Thus,  the  addition  of  the  correction  term  to 
equation  (A-7)  will  make  It  conpatlble  with  ordinary  rules  of 
differentiation  and  Integration. 

In  simulating  equation  (A-3)  or  Its  differential  equivalence 
(equation  (A-2))  special  care  must  be  given  to  the  simulation  of  the 


term  /B(xs,s)dws  because  (dw)  Is  highly  uncorrelated.  Thus,  It  is 
difficult  to  find  an  appropriate  integration  step-size  at.  However, 
the  following  approximation  scheme  is  appropriate. 


Let  R  be  the  maximum  rate  of  change  and  defined  as 


!HCJr 


|F(x,t)| 
“ T  |xl 


G2(x,t) 
1  +  xz 


( A— 11 ) 


Now  let  at«J  ,  e  {dw)2-  at. 

Then,  equation  (A-2)  or  equation  (A-3)  can  integrated  using 
digital  computer  as  follows 


*k+l"  ’Sc  +  At  F^xk*  +  A"  S^xk» 

*  °-5  “  G(V  V  — inb5-  • 


(*-12) 


Hence,  the  last  term  in  equation  (A-12)  is  the  correction  term  which 
is  essential  for  the  simulation  of  the  Ito  differential  or  (Integral) 
equation  by  digital -computer. 

Let  us  return  for  a  moment  to  equation  (A-3)  and  try  to  find 
some  physical  Interpretation  to  It.  If  x(t)  Is  the  state  of  a 
dynamical  system,  then  the  terms  on  the  right-hand  side  of  equation 
(A-3)  can  have  a  nice  Interpretation.  The  term  xq  Is  just  the 
Initial  condition.  The  first  Integral  describes  the  evolution  of  the 
component  of  the  state  with  time.  The  second  Integral  can  be 


i 


128 


considered  as  the  Irregular  component,  which  Is  entirely  due  to 
noise.  Furthermore,  It  Is  well  known,  {using  practical  engineering 
assumptions)  that  any  continuous  noise  can  be  regarded  as  a  smooth 
transformation  of  a  standard  Wiener  process.  Unfortunately,  the 
standard  Wiener  process  Is  not  differentiable  with  respect  to  time, 
hence  no  such  <W  exists.  Thus,  only  the  stochastic  Integral  as  In 
equation  (A-3)  Is  available  for  modeling  real  systems.  However,  as 
pointed  out  earlier.  If  the  white  Gaussian  noise  Is  considered  as  a 
formal  derivative  of  the  Wiener  process  then  the  stochastic 
differential  equations  (A-1),  { A-2)  would  have  a  particular  appeal  to 
engineering  applications.  Furthermore,  the  white-noise  concept 
allows  us  to  manipulate  the  Wiener  Integral  (which  Is  a  special  case 
of  the  Ito  Integral,  If  certain  conditions  about  the  function  G(xs,s) 
are  satisfied  l.e.  G(x$,s)  Is  square  Integrable)  as  an  ordinary 
Integral  but  not  In  the  same  sense  as  the  Stleltjes  Integral  Is 
defined.  It  also  allows  a  suitable  mathematically  tractable  model 
for  many  continuous  physical  noises  encountered  In  real  engineering 
systems. 

Mow  that  equations  (A-1),  (A-2)  and  (A-3)  have  been  presented 

,  formally,  let  us  discuss  briefly  the  use  of  such  equations  In 

i 

I  modeling  real  physical  processes  If  they  are  going  to  be  of  aqy 

practical  value.  As  It  Is  known  In  many  practical  problems  both  In 

i 

i  control  and  communications,  differential  equations  arise  from  the 

laws  of  nature,  but  It  Is  not  advisable  to  take  derivatives  of 
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certain  signals.  Thus,  It  Is  appropriate  to  model  these  signals  by 
so-called  stochastic  differential  equations.  In  addition,  almost  any 
mathematical  model  of  a  physical  process  Involves  a  degree  of 
Idealization  that  produces  a  good  match  with  reality  only  within 
certain  ranges  of  the  parameters  involved.  Thus,  the  result  of  such 
a  modeling  can  be  judged  only  by  conparison  with  practical 
experiments  within  the  prescribed  ranges  of  the  parameters 
Involved.  For  example,  the  erratic  motion  of  a  particle  or  point 
mass  submerged  In  a  fluid  caused  by  Impact  of  the  molecules  of  the 
liquid  on  the  particle.  The  force  acting  on  the  particle  can  be 
approximated  by 

-ax(t)  +  8  un(t).  (A— 13) 

where  x(t)  denotes  a  position  of  the  velocity  of  the  particle  at  time 
t.  Here,  the  first  term  In  (A-13)  represents  friction  or  drag,  while 
the  second  represents  the  push  Imparted  upon  the  particle  by  some 
projecting  force  which  Is  random  In  nature.  If  the  process  un(t)  Is 
replaced  by  white  Gaussian  noise  nt.  Then  the  motion  of  the  particle 
can  be  approximated  by 

x(t)  +  Jx(t)  t>0.  ( A-14) 

Here  m  is  the  mass  of  the  particle,  and  *(t)  Is  the  acceleration. 
Hence  we  arrived  at  the  famous  Langevin  equation.  Thus,  the  velocity 
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coordinate,  (with  certain  assumptions  about  the  Initial  conditions), 
will  be  an  Ornstein-Uhlenbeck  process  which  can  be  obtained  as  the 
solution  of  the  stochastic  equation 

dxt  *  -axt  +  cdwt,  (A-15) 

where  wt  Is  a  Wiener  process  and  a,c  are  positive  constants. 

The  relevance  of  the  model  (A-14)  or  equivalently  (A-15)  can  now 
be  tested,  for  example,  by  observing  to  motion  of  the  particle  and 
deciding  whether  the  statistic,  (mean,  variance),  of  the  displace¬ 
ments  of  the  particle  can  be  described  as  a  white  Gaussian  noise. 

The  practical  Implementation  of  the  concept  of  weak/strong 
solutions  depends  In  large  on  the  particular  application  at  hand,  and 
the  system  modeling  approach.  A  strong  solution  usually  deals  with  a 
"given"  Wiener  process  as  a  model  for  a  wide  spectrum  random  noise, 
while  a  weak  solution  Is  based  on  the  promise  that  there  exists  a 
Wiener  process  which  can  be  used  as  such  a  model.  Thus,  If  the 
physical  properties  of  a  given  problem  specifies  the  probability 
space  (a, c,p),  the  system's  possible  outcome  event  setfc^),  teCO.T], 
and  the  Wiener  process  w  »  (wt)  then  the  strong  solution  approach  Is 
appropriate.  On  the  other  hand.  If  the  physical  nature  of  the 
problem  does  not  specify  the  complete  probability  space  (a,c,p), 
then,  the  weak  solution  approach  might  be  more  suitable  as  a  modeling 
approach.  Thus,  we  may  construct  a  probability  space  (a,c,p),  a 
system  Ut),  tc[0,T],  and  a  Wiener  process  w«(wt),  for  which  (A-5)- 
(A-6)  are  satisfied  (p.a.s.),  to  satisfy  the  modeling  purpose. 
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APPENDIX  B 


AUXILIARY  RESULTS 
Lemma  B1  [D2] 

Let  Cxtn]  and  [ytn]  be  two  jointly  normal  processes  and  let 
E(x/yt)  *  x  «  PtYt*  the  Pr°Ject^on  x  onto  H ty  .  Then,  the 
conditional  characteristic  function,  (for  the  vector  case). 


ExpCjwSL-  0.5  i  s  *4*4^4 3. 
z  1-1  j-1  1  J 


where  w  «  (w^Wg,..^), 


( B— 1 ) 


xt  *  (x^,  x2,...xm)  ,  the  conditional  expectation, 

“u  *  “ji  *  £((xr  ««. 


the  condl tonal  covariance. 

The  proof  Is  an  extension  of  the  proof  given  In  [D2,  pp.  53]  for  the 
scalar  case,  and  Is  omitted  for  brevity. 


Lemma  B2,  [P2,  pp.  146] 

Let  [xj],  1*1, 2,... n  be  Jointly  normal  processes  with  the 
conditional  characteristic  function  f x/y (w)  as  given  In  Lemma  B2. 
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Then,  the  condi tonal  expectation  of  order  r  •  +  k2  +...kn,  Is 

r  .  ,w9...w_) 

J  E[(xlf...xn/ys;  s<t)]  » - y- - jr- - —p  (B-2) 

3w,  ...  n  w  *  0 

x  c  n 

The  proof  Is  parallel  to  the  proof  given  In  [P2]  using  the  above  two 
lemmas;  the  following  results  are  noted; 


E(x1x2x3/ys;  «t)  «  XjU23+  XgU^  x3u12+  ,  (B-13) 


11*1*2*?^*  *  "lptf  u13“24+  u14u23+  *1*2U34 


+  X1*3u24+  ^i*4u23+  *2*3u14+ 


x2x4ui3+  X3X4u14^  *1 *2*3*4' 


( B— 14) 


S<t)  *  \hh\h*  *l[u23u45+  “24u35+ 


+  *2^u13“45+  u14u35+  u15u34^  +  ?3^u15u24  +  u25u14 

+  u12u45^  +  *4Cu12u35+  u13“25+  u15“233 

+  ^5tu12u34+  u13u24+u14u23^ 

+  *1*2*3U45+  *1*2*4U35+  *1*2*5U34 

+  *2*3^4ui5+  *2*3*5u14+  *3*4*5u12 

+  *1*4*5U23+  *1*3*5U24+  *2*4*5U13 

♦  *1*3*  4^5  ^8*5J 

Here  Ulj-  E{ (x^ -x^ )(xj-Xj )*/ys;  sft)}  »  the  condition  covariance. 
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